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SEQUENCE LISTING 

<110> Yocum, R. et al. 

<120> MICROORGANISMS AND ASSAYS FOR THE IDENTIFICATION OF 
ANTIBIOTICS 

<130> OGZ-001 

<140> 
<141> 

<150> US 60/227,860 
<151> 2000-08-24 



<160> 76 

<170> Patentln Ver. 2.0 

<210> 1 
<211> 777 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (774) 

<400> 1 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 4 8 

Leu Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 

15 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 96 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg lie Glu Thr Ser Arg 
20 25 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cgc tec tta ttt gat 14 4 
His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin lie Asp Gly lie lie lie Ser Ser 
50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 240 
Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa acc ggt tta 288 
Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 336 
Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 
100 105 110 

aat get gtc get gcg ata cac trg tac ggc aat cca tta att gtt gtc 384 
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Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

gat ttc gga acc gcc aca acg tac tgc tat att gat gaa aac aaa caa 432 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

tac atg ggc ggg gcg att gcc cct ggg att aca att teg aca gag gcg 4 80 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate acc egg ccc 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
165 170 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga att 
Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 
Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

tgg cag gca aaa cag gac etc aag gtc att gcg aca gga ggc ctg gcg 672 
Trp Gin Ala Lys Gin Asp Leu Lys Val lie Ala Thr Gly Gly Leu Ala 
210 215 220 



ccg etc att gcg aac gaa tea gat tgt ata gac ate gtt gat cca ttc 
Pro Leu lie Ala Asn Glu Ser Asp Cys lie Asp lie Val Asp Pro Phe 
225 230 235 240 

tta acc eta aaa ggg ctg gaa ttg att tat gaa aga aac cgc gta gga 
Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 
245 250 255 

agt gta tag 
Ser Val 



<210> 2 
<211> 258 
<212> PRT 

<213> Bacillus subtilis 
<400> 2 

Leu Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 

His Lys Thr Glu Asp Glu Phe Gly Met lie Leu Arg Ser Leu Phe Asp 
35 40 45 

His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
50 55 60 

Val Val Pro Pro He Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 



528 



57 6 



624 



720 



768 



777 
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85 



90 



95 



Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 
100 105 HO 

Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
165 170 175 

Asp Asn lie lie Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

Trp Gin Ala Lys Gin Asp Leu Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 

Pro Leu He Ala Asn Glu Ser Asp Cys He Asp He Val Asp Pro Phe 
225 230 235 240 

Leu Thr Leu Lys Gly Leu Glu Leu He Tyr Glu Arg Asn Arg Val Gly 
245 250 255 



Ser Val 



<210> 3 
<211> 250 
<212> PRT 

<213> Clostridium acetobutylicum 
<400> 3 

Asn Lys Arg Ala Ala Phe Met Leu Leu Leu Phe Leu Arg Ser Val Leu 
15 10 15 

Lys Val He Leu Val Leu Asp Val Gly Asn Thr Asn He Val Leu Gly 
20 25 30 

He Tyr Asn Asp Thr Lys Leu Thr Ala Glu Trp Arg Leu Ser Thr Asp 
35 40 45 

Val Leu Arg Ser Ala Asp Glu Tyr Gly He Gin Val Met Asn Leu Phe 
50 55 60 

Gin Gin Asp Lys Leu Asp Pro Thr Leu Val Glu Gly Val He He Ser 
65 70 75 80 

Ser Val Val Pro Asn He Met Tyr Ser Leu Glu His Met He Arg Lys 
85 90 95 

Tyr Phe Lys He Asn Pro Leu Val Val Gly Pro Gly He Lys Thr Gly 
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100 

lie Asn lie Lys 
115 

Val Asn Ala Val 
130 

lie Asp Phe Gly 
145 

Asp Tyr Leu Gly 



Ala Leu Phe Glu 
180 



Pro Ala Tyr Ala 
195 

lie Val Tyr Arg 
210 

Lys Glu Asn Leu 
225 

Ala Thr Gly Gly 



Tyr Asp Asn Pro 
120 

Ala Ala His Glu 
135 

Thr Ala Thr Thr 
150 

Gly Ala He Cys 
165 

Lys Ala Ala Lys 



lie Cys Lys Asn 
200 

Tyr Leu Arg Gin 
215 

Pro Asp Gly Arg 
230 

Leu Ala Lys Leu 
245 



105 

Lys Glu Val Gly 



He Tyr Lys Arg 
140 

Phe Cys Ala Val 
155 

Pro Gly lie Lys 
170 

Leu Pro Arg Val 
185 

Thr He Ser Ser 



Val Lys Tyr Leu 
220 

Arg Thr Arg Thr 
235 

He Asn 
250 



110 

Ala Asp Arg He 
125 

Ser Leu He He 



Arg Glu Asn Gly 
160 

Val Ser Ser Glu 
175 

Glu Leu He Lys 
190 

He Gin Ser Gly 
205 

Phe Glu Lys Leu 



Ser Leu Val Leu 
240 



<210> 4 
<211> 265 
<212> PRT 

<213> Streptomyces coelicolor 
<400> 4 

Met Leu Leu Thr He Asp Val Gly Asn Thr His Thr Val Leu Gly Leu 
15 10 15 

Phe Asp Gly Glu Asp He Val Glu His Trp Arg He Ser Thr Asp Ser 
20 25 30 

Arg Arg Thr Ala Asp Glu Leu Ala Val Leu Leu Gin Gly Leu Met Gly 
35 40 45 

Met His Pro Leu Leu Gly Asp Glu Leu Gly Asp Gly He Asp Gly He 
50 55 60 

Ala He Cys Ala Thr Val Pro Ser Val Leu His Glu Leu Arg Glu Val 
65 70 75 80 

Thr Arg Arg Tyr Tyr Gly Asp Val Pro Ala Val Leu Val Glu Pro Gly 
85 90 95 

Val Lys Thr Gly Val Pro He Leu Thr Asp His Pro Lys Glu Val Gly 
100 105 HO 

Ala Asp Arg He He Asn Ala Val Ala Ala Val Glu Leu Tyr Gly Gly 
115 120 125 

Pro Ala He Val Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Ala Val 
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130 



135 140 



Ser Ala Arg Gly Glu Tyr He Gly Gly Val He Ala Pro Gly He Glu 
145 150 155 160 

He Ser Val Glu Ala Leu Gly Val Lys Gly Ala Gin Leu Arg Lys He 
165 170 175 

Glu Val Ala Arg Pro Arg Ser Val He Gly Lys Asn Thr Val Glu Ala 
180 185 190 

Met Gin Ser Gly He Val Tyr Gly Phe Ala Gly Gin Val Asp Gly Val 
195 200 205 

Val Asn Arg Met Ala Arg Glu Leu Ala Asp Asp Pro Asp Asp Val Thr 
210 215 220 

Val He Ala Thr Gly Gly Leu Ala Pro Met Val Leu Gly Glu Ser Ser 
225 230 235 240 

Val He Asp Glu His Glu Pro Trp Leu Thr Leu Met Gly Leu Arg Leu 
245 250 255 

Val Tyr Glu Arg Asn Val Ser Arg Met 
260 265 



<210> 5 
<211> 272 
<212> PRT 

<213> Mycobacterium tuberculosis 
<4 00> 5 

Met Leu Leu Ala He Asp Val Arg Asn Thr His Thr Val Val Gly Leu 
15 10 15 

Leu Ser Gly Met Lys Glu His Ala Lys Val Val Gin Gin Trp Arg He 
20 25 30 

Arg Thr Glu Ser Glu Val Thr Ala Asp Glu Leu Ala Leu Thr He Asp 
35 40 45 

Gly Leu He Gly Glu Asp Ser Glu Arg Leu Thr Gly Thr Ala Ala Leu 
50 55 60 

Ser Thr Val Pro Ser Val Leu His Glu Val Arg He Met Leu Asp Gin 
65 70 75 80 

Tvr Trp Pro Ser Val Pro His Val Leu He Glu Pro Gly Val Arg Thr 
85 90 95 

Gly He Pro Leu Leu Val Asp Asn Pro Lys Glu Val Gly Ala Asp Arg 
100 105 HO 

He Val Asn Cys Leu Ala Ala Tyr Asp Arg Phe Arg Lys Ala Ala He 
115 120 125 

Val Val Asp Phe Gly Ser Ser He Cys Val Asp Val Val Ser Ala Lys 
130 135 140 

Gly Glu Phe Leu Gly Gly Ala He Ala Pro Gly Val Gin Val Ser Ser 
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145 



150 155 160 



Asp Ala Ala Ala Ala Arg Ser Ala Ala Leu Arg Arg Val Glu Leu Ala 
165 170 175 

Arg Pro Arg Ser Val Val Gly Lys Asn Thr Val Glu Cys Met Gin Ala 
180 185 190 

Gly Ala Val Phe Gly Phe Ala Gly Leu Val Asp Gly Leu Val Gly Arg 
195 200 205 

He Arg Glu Asp Val Ser Gly Phe Ser Val Asp His Asp Val Ala He 
210 215 220 

Val Ala Thr Gly His Thr Ala Pro Leu Leu Leu Pro Glu Leu His Thr 
225 230 235 240 

Val Asp His Tyr Asp Gin His Leu Thr Leu Gin Gly Leu Arg Leu Val 
245 250 255 

Phe Glu Arg Asn Leu Glu Val Gin Arg Gly Arg Leu Lys Thr Ala Arg 
260 265 270 



<210> 6 
<211> 258 
<212> PRT 

<213> Rhodobacter capsulatus 
<400> 6 

Met Leu Leu Cys He Asp Cys Gly Asn Thr Asn Thr Val Phe Ser Val 
15 10 15 

Trp Asp Gly Thr Asp Phe Ala Ala Thr Trp Arg He Ala Thr Asp His 
20 25 30 

Arg Arg Thr Ala Asp Glu Tyr Phe Val Trp Leu Asn Thr Leu Met Gin 
35 40 45 

Leu Lys Gly Leu Gin Gly Arg He Ser Glu Ala He He Ser Ser Thr 
50 55 60 

Ala Pro Arg Val Val Phe Asn Leu Arg Val Leu Cys Asn Arg Tyr Phe 
65 70 75 80 

Asp Cys Arg Pro Tyr Val Val Gly Lys Pro Gly Cys Glu Leu Pro Val 
85 90 95 

Ala Pro Arg Val Asp Pro Gly Thr Thr Val Gly Pro Asp Arg Leu Val 
100 105 HO 

Asn Thr Val Ala Gly Tyr Asp Arg His Gly Gly Asp Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Val Val Ala Pro Asp Gly Ala 
130 135 140 

Tyr He Gly Gly Val He Ala Pro Gly Val Asn Leu Ser Leu Glu Ala 
145 150 155 160 

Leu His Met Ala Ala Ala Ala Leu Pro His Val Asp Val Thr Lys Pro 
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Gin Gly Val He 
180 

Tyr Trp Gly Tyr 
195 

Met Glu Arg Asp 
210 

Ser Leu Phe Asp 
225 

Leu Thr Met His 



165 

Gly Thr Asn Thr 



He Gly Leu Val 
200 

Arg Pro Met Lys 
215 

Leu Gly Phe Asp 
230 

Gly Leu Arg Leu 
245 



170 

Val Ala Cys He 
185 

Glu Gly He Val 



Val He Ala Thr 
220 

Leu Phe Asp Lys 
235 

He Phe Asp Tyr 

250 



175 

Gin Ser Gly Val 
190 

Arg Gin He Arg 
205 

Gly Gly Leu Ala 



Val Glu Asp Asp 
240 

Asn Lys Gly Leu 
255 



Gly Ala 



<210> 7 
<211> 255 
<212> PRT 

<213> Geobacter sulf urreducens 
<400> 7 

Met Leu Leu Val He Asp Val Gly Asn Thr Asn He Val Leu Gly He 
15 10 15 

Tyr Asp Gly Glu Arg Leu Val Arg Asp Trp Arg Val Ser Thr Asp Lys 
20 25 30 

Ala Arg Thr Thr Asp Glu Tyr Gly He Leu He Asn Glu Leu Phe Arg 
35 40 45 

Leu Ala Gly Leu Gly Leu Asp Gin He Arg Ala Val He He Ser Ser 
50 55 60 

Val Val Pro Pro Leu Thr Gly Val Leu Glu Arg Leu Ser Leu Gly Tyr 
65 70 75 80 

Phe Gly Met Arg Pro Leu Val Val Gly Pro Gly He Lys Thr Gly Met 
85 90 95 

Pro He Gin Tyr Asp Asn Pro Arg Glu Val Gly Ala Asp Arg He Val 
100 105 110 

Asn Ala Val Ala Gly Tyr Glu Lys Tyr Arg Thr Ser Leu He He Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Asp Tyr Val Asn Arg Lys Gly Glu 
130 135 140 

Tyr Cys Gly Gly Ala He Ala Pro Gly Leu Val He Ser Thr Glu Ala 
145 150 155 160 

Leu Phe Gin Arg Ala Ser Lys Leu Pro Arg Val Asp He He Arg Pro 
165 170 175 



Ser Ala He He Ala Arg Asn Thr Val Asn Ser Met Gin Ala Gly lie 
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180 

Tyr Tyr Gly Tyr 
195 

Ala Glu Ser Lys 
210 

Ser Leu lie Ala 
225 

Leu Thr Leu Glu 



Val Gly Leu Val 
200 

Asp Ala Pro Arg 
215 

Pro Glu Ser Lys 
230 

Gly Leu Arg lie 
245 



185 

Asp Glu He Val 



Val He Ala Thr 
220 

Thr lie Glu Ala 
235 

Leu Tyr Glu Arg 
250 



190 

Thr Arg Met Lys 
205 

Gly Gly Leu Ala 



Val Glu Glu Tyr 
240 

Asn Arg Glu 
255 



<210> 8 
<211> 262 
<212> PRT 

<213> Deinococcus radiopugnans 
<400> 8 

Met Pro Ala Phe Pro Leu Leu Ala Val Asp lie Gly Asn Thr Thr Thr 
15 10 15 

Val Leu Gly Leu Ala Asp Ala Ser Gly Ala Leu Thr His Thr Trp Arg 
20 25 30 

He Arg Thr Asn Arg Glu Met Leu Pro Asp Asp Leu Ala Leu Gin Leu 
35 40 45 

His Gly Leu Phe Thr Leu Ala Gly Ala Pro lie Pro Arg Ala Ala Val 
50 55 60 

Leu Ser Ser Val Ala Pro Pro Val Gly Glu Asn Tyr Ala Leu Ala Leu 
65 70 75 80 

Lys Arg His Phe Met He Asp Ala Phe Ala Val Ser Ala Glu Asn Leu 
85 90 95 

Pro Asp Val Thr Val Glu Leu Asp Thr Pro Gly Ser Val Gly Ala Asp 
100 105 110 

Arg Leu Cys Asn Leu Phe Gly Ala Glu Lys Tyr Leu Gly Gly Leu Asp 
115 120 125 

Tyr Ala Val Val Val Asp Phe Gly Thr Ser Thr Asn Phe Asp Val Val 
130 135 140 

Gly Arg Gly Arg Arg Phe Leu Gly Gly He Leu Ala Thr Gly Ala Gin 
145 150 155 160 

Val Ser Ala Asp Ala Leu Phe Ala Arg Ala Ala Lys Leu Pro Arg He 
165 170 175 

Thr Leu Gin Ala Pro Glu Thr Ala He Gly Lys Asn Thr Val His Ala 
180 185 190 

Leu Gin Ser Gly Leu Val Phe Gly Tyr Ala Glu Met Val Asp Gly Leu 
195 200 205 



Leu Arg Arg lie Arg Ala Glu Leu Pro Gly Glu Ala Val Ala Val Ala 
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210 215 220 

Thr Gly Gly Phe Ser Arg Thr Val Gin Gly lie Cys Gin Glu lie Asp 
225 230 235 240 

Tyr Tyr Asp Glu Thr Leu Thr Leu Arg Gly Leu Val Glu Leu Trp Ala 
245 250 255 

Ser Arg Ser Glu Val Arg 
260 



<210> 9 
<211> 246 
<212> PRT 

<213> Thermotoga maritima 
<400> 9 

Met Tyr Leu Leu Val Asp Val Gly Asn Thr His Ser Val Phe Ser lie 
15 10 15 

Thr Glu Asp Gly Lys Thr Phe Arg Arg Trp Arg Leu Ser Thr Gly Val 
20 25 30 

Phe Gin Thr Glu Asp Glu Leu Phe Ser His Leu His Pro Leu Leu Gly 
35 40 45 

Asp Ala Met Arg Glu lie Lys Gly lie Gly Val Ala Ser Val Val Pro 
50 55 60 

Thr Gin Asn Thr Val lie Glu Arg Phe Ser Gin Lys Tyr Phe His lie 
65 70 75 80 

Ser Pro lie Trp Val Lys Ala Lys Asn Gly Cys Val Lys Trp Asn Val 
85 90 95 

Lys Asn Pro Ser Glu Val Gly Ala Asp Arg Val Ala Asn Val Val Ala 
100 105 110 

Phe Val Lys Glu Tyr Gly Lys Asn Gly lie lie lie Asp Met Gly Thr 
115 120 125 

Ala Thr Thr Val Asp Leu Val Val Asn Gly Ser Tyr Glu Gly Gly Ala 
130 135 140 

lie Leu Pro Gly Phe Phe Met Met Val His Ser Leu Phe Arg Gly Thr 
145 150 155 160 

Ala Lys Leu Pro Leu Val Glu Val Lys Pro Ala Asp Phe Val Val Gly 
165 170 175 

Lys Asp Thr Glu Glu Asn lie Arg Leu Gly Val Val Asn Gly Ser Val 
180 185 190 

Tyr Ala Leu Glu Gly lie lie Gly Arg lie Lys Glu Val Tyr Gly Asp 
195 200 205 

Leu Pro Val Val Leu Thr Gly Gly Gin Ser Lys lie Val Lys Asp Met 
210 215 220 



lie Lys His Glu lie Phe Asp Glu Asp Leu Thr lie Lys Gly Val Tyr 
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225 230 235 240 

His Phe Cys Phe Gly Asp 
245 



<210> 10 
<211> 273 
<212> PRT 

<213> Treponema pallidum 
<400> 10 

Met Leu Leu lie Asp Val Gly Asn Ser His Val Val Phe Gly lie Gin 
15 10 15 

Gly Glu Asn Gly Gly Arg Val Cys Val Arg Glu Leu Phe Arg Leu Ala 
20 25 30 

Pro Asp Ala Arg Lys Thr Gin Asp Glu Tyr Ser Leu Leu lie His Ala 
35 40 45 

Leu Cys Glu Arg Ala Gly Val Gly Arg Ala Ser Leu Arg Asp Ala Phe 
50 55 60 

lie Ser Ser Val Val Pro Val Leu Thr Lys Thr lie Ala Asp Ala Val 
65 70 75 80 

Ala Gin lie Ser Gly Val Gin Pro Val Val Phe Gly Pro Trp Ala Tyr 
85 90 95 

Glu His Lea Pro Val Arg lie Pro Glu Pro Val Arg Ala Glu He Gly 
100 105 110 

Thr Asp Leu Val Ala Asn Ala Val Ala Ala Tyr Val His Phe Arg Ser 
115 120 125 

Ala Cys Val Val Val Asp Cys Gly Thr Ala Leu Thr Phe Thr Ala Val 
130 135 140 

Asp Gly Thr Gly Leu lie Gin Gly Val Ala He Ala Pro Gly Leu Arg 
145 150 155 160 

Thr Ala Val Gin Ser Leu His Thr Gly Thr Ala Gin Leu Pro Leu Val 
165 170 175 

Pro Leu Ala Leu Pro Asp Ser Val Leu Gly Lys Asp Thr Thr His Ala 
180 185 190 

Val Gin Ala Gly Val Val Arg Gly Thr Leu Phe Val lie Arg Ala Met 
195 200 205 

lie Ala Gin Cys Gin Lys Glu Leu Gly Cys Arg Cys Ala Ala Val lie 
210 215 220 

Thr Gly Gly Leu Ser Arg Leu Phe Ser Ser Glu Val Asp Phe Pro Pro 
225 230 235 240 

lie Asp Ala Gin Leu Thr Leu Ser Gly Leu Ala His He Ala Arg Leu 
245 250 255 



Val Pro Thr Ser Leu Leu Pro Pro Ala Thr Val Ser Gly Ser Ser Gly 
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260 265 270 

Asn 



<210> 11 
<211> 262 
<212> PRT 

<213> Borrelia burgdorferi 
<400> 11 

Met Asn Lys Pro Leu Leu Ser Glu Leu lie lie Asp lie Gly Asn Thr 
15 10 15 

Ser lie Ala Phe Ala Leu Phe Lys Asp Asn Gin Val Asn Leu Phe lie 
20 25 30 

Lys Met Lys Thr Asn Leu Met Leu Arg Tyr Asp Glu Val Tyr Ser Phe 
35 4 0 4 5 

Phe Glu Glu Asn Phe Asp Phe Asn Val Asn Lys Val Phe lie Ser Ser 
50 55 60 

Val Val Pro lie Leu Asn Glu Thr Phe Lys Asn Val lie Phe Ser Phe 
65 70 75 80 

Phe Lys lie Lys Pro Leu Phe lie Gly Phe Asp Leu Asn Tyr Asp Leu 
85 90 95 

Thr Phe Asn Pro Tyr Lys Ser Asp Lys Phe Leu Leu Gly Ser Asp Val 
100 105 110 

Phe Ala Asn Leu Val Ala Ala lie Glu Asn Tyr Ser Phe Glu Asn Val 
115 120 125 

Leu Val Val Asp Leu Gly Thr Ala Cys Thr lie Phe Ala Val Ser Arg 
130 135 140 

Gin Asp Gly He Leu Gly Gly He He Asn Ser Gly Pro Leu He Asn 
145 150 155 160 

Phe Asn Ser Leu Leu Asp Asn Ala Tyr Leu He Lys Lys Phe Pro He 
165 170 175 

Ser Thr Pro Asn Asn Leu Leu Glu Arg Thr Thr Ser Gly Ser Val Asn 
180 185 190 

Ser Gly Leu Phe Tyr Gin Tyr Lys Tyr Leu He Glu Gly Val Tyr Arg 
195 200 205 

Asp lie Lys Gin Met Tyr Lys Lys Lys Phe Asn Leu lie lie Thr Gly 
210 215 220 

Gly Asn Ala Asp Leu He Leu Ser Leu lie Glu He Glu Phe He Phe 
225 230 235 240 

Asn He His Leu Thr Val Glu Gly Val Arg He Leu Gly Asn Ser He 
245 250 255 



Asp Phe Lys Phe Val Asn 
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260 



<210> 12 
<211> 229 
<212> PRT 

<213> Aquifex aeolicus 
<400> 12 

Met Arg Phe Leu Thr Val Asp Val Gly Asn Ser Ser Val Asp lie Ala 
15 10 15 

Leu Trp Glu Gly Lys Lys Val Lys Asp Phe Leu Lys Leu Ser His Glu 
20 25 30 

Glu Phe Leu Lys Glu Glu Phe Pro Lys Leu Lys Ala Leu Gly lie Ser 
35 40 45 

Val Lys Gin Ser Phe Ser Glu Lys Val Arg Gly Lys lie Pro Lys lie 
50 55 60 

Lys Phe Leu Lys Lys Glu Asn Phe Pro lie Gin Val Asp Tyr Lys Thr 
65 70 75 80 

Pro Glu Thr Leu Gly Thr Asp Arg Val Ala Leu Ala Tyr Ser Ala Lys 
85 90 95 

Lys Phe Tyr Gly Lys Asn Val Val Val lie Ser Ala Gly Thr Ala Leu 
100 105 110 

Val lie Asp Leu Val Leu Glu Gly Lys Phe Lys Gly Gly Phe lie Thr 
115 120 125 

Leu Gly Leu Gly Lys Lys Leu Lys lie Leu Ser Asp Leu Ala Glu Gly 
130 135 140 

lie Pro Glu Phe Phe Pro Glu Glu Val Glu lie Phe Leu Gly Arg Ser 
145 150 155 160 

Thr Arg Glu Cys Val Leu Gly Gly Ala Tyr Arg Glu Ser Thr Glu Phe 
165 170 175 

lie Lys Ser Thr Leu Lys Leu Trp Arg Lys Val Phe Lys Arg Lys Phe 
180 185 190 

Lys Val Val lie Thr Gly Gly Glu Gly Lys Tyr Phe Ser Lys Phe Gly 
195 200 205 

lie Tyr Asp Pro Leu Leu Val His Arg Gly Met Arg Asn Leu Leu Tyr 
210 215 220 

Leu Tyr His Arg lie 
225 



<210> 13 
<211> 257 
<212> PRT 

<213> Synechocystis sp . 



<400> 13 
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Met Glu Thr Ser Lys Pro Gly Cys Gly Leu Ala Leu Asp Asn Asp Lys 
15 10 15 

Gin Lys Pro Trp Leu Gly Leu Met lie Gly Asn Ser Arg Leu His Trp 
20 25 30 

Ala Tyr Cys Ser Gly Asn Ala Pro Leu Gin Thr Trp Val Thr Asp Tyr 
35 40 45 

Asn Pro Lys Ser Ala Gin Leu Pro Val Leu Leu Gly Lys Val Pro Leu 
50 55 60 

Met Leu Ala Ser Val Val Pro Glu Gin Thr Glu Val Trp Arg Val Tyr 
65 70 75 80 

Gin Pro Lys lie Leu Thr Leu Lys Asn Leu Pro Leu Val Asn Leu Tyr 
85 90 95 

Pro Ser Phe Gly lie Asp Arcr Ala Leu Ala Gly Leu Gly Thr Gly Leu 
100 105 110 

Thr Tyr Gly Phe Pro Cys Leu Val Val Asp Gly Gly Thr Ala Leu Thr 
115 120 125 

lie Thr Gly Phe Asp Gin Asp Lys Lys Leu Val Gly Gly Ala lie Leu 
130 135 140 

Pro Gly Leu Gly Leu Gin Leu Ala Thr Leu Gly Asp Arg Leu Ala Ala 
145 150 155 160 

Leu Pro Lys Leu Glu Met Asp Gin Leu Thr Glu Leu Pro Asp Arg Trp 
165 170 175 

Ala Leu Asp Thr Pro Ser Ala lie Phe Ser Gly Val Val Tyr Gly Val 
180 185 190 

Leu Gly Ala Leu Gin Ser Tyr Leu Gin Asp Trp Gin Lys Leu Phe Pro 
195 200 205 

Gly Ala Ala Met Val lie Thr Gly Gly Asp Gly Lys lie Leu His Gly 
210 215 220 

Phe Leu Lys Glu His Ser Pro Asn Leu Ser Val Ala Trp Asp Asp Asn 
225 230 235 240 

Leu lie Phe Leu Gly Met Ala Ala lie His His Gly Asp Arg Pro lie 
245 250 255 



Cys 



<210> 14 
<211> 223 
<212> PRT 

<213> Helicobacter pylori 



<400> 14 

Met Pro Ala Arg Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys 
15 10 15 
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Asp lie Gly Asn 
20 

Ser Ser Ala Lys 
35 

Phe Tyr lie Ser 
50 

Tyr Pro Asn Ala 
65 

Tyr Val Gly Leu 



Asn Gly Val Val 
100 

Lys Glu Gly Lys 
115 

Tyr lie His Ala 
130 

Lys Ala Leu Asp 
145 

Val Asn Tyr Gly 



Ala Lys Asn Gin 

180 

Ser Ala Phe Leu 
195 

Gly Met Glu lie 
210 



<210> 15 
<211> 267 
<212> PRT 
<213> Bordetell, 

<400> 15 

Met lie lie Leu 

1 

Phe Asp Pro Asp 
20 

Phe Asp Asn Leu 
35 

Pro Arg Arg Pro 
50 



Thr Arg lie His 



Glu Asp Leu Lys 
40 

Val Asn Glu Glu 
55 

Lys Asn lie Ala 
70 

Gly lie Asp Arg 

85 

Val Asp Ala Gly 



His Leu Gly Gly 
120 

Tyr Lys Lys Ser 
135 

Ser Leu Glu Val 
150 

Met Val Leu Ser 
165 

Lys lie Tyr Leu 



Pro His Ser Val 
200 

Ala Leu Lys Lys 
215 



pertussis 



lie Asp Ser Gly 
5 

Ala Pro Gin Ala 



Asp Leu Asp Ala 
40 

Gin Arg Ala Leu 
55 



Phe Ala Gin Asn 
25 

Arg Leu Gly lie 



Asn Glu Lys Ala 
60 

Gly Phe Phe His 
75 

Gin Met Ala Cys 

90 

Ser Ala lie Thr 
105 

Cys lie Leu Pro 



Ala Lys lie Leu 
140 

Leu Pro Lys Ser 
155 

Val lie Ala Cys 
170 

Cys Gly Gly Asp 
185 

Cys Lys Glu Arg 



Ala Gly lie Leu 
220 



Asn Ser Arg Leu 
10 

Ala Arg Glu Pro 
25 

Leu Gly Arg Trp 



Gly Val Asn Val 
60 



Tyr Gin Leu Phe 
30 

Gin Lys Glu lie 
45 

Leu Leu Asn Cys 



Leu Glu Thr Asp 
80 

Leu Ala Val Asn 
95 

lie Asp Leu lie 
110 

Gly Leu Ala Gin 
125 

Glu Gin Pro Phe 



Thr Arg Asp Ala 
160 

lie Gin His Leu 
175 

Ala Lys Tyr Leu 
ISO 

Leu Val Phe Asp 
205 

Glu Cys Lys 



Lys Val Gly Trp 
15 

Ala Pro Val Ala 
30 

Leu Ala Thr Leu 
45 

Ala Gly Leu Ala 



Arg Gly Glu Ala lie Ala Ala Thr Leu Arg Ala Gly Gly Cys Asp He 
65 70 75 80 
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Arg Trp Leu Arg Ala Gin Pro Leu Ala Met Gly Leu Arg Asn Gly Tyr 
85 90 95 

Arg Asn Pro Asp Gin Leu Gly Ala Asp Arg Trp Ala Cys Met Val Gly 
100 105 110 

Val Leu Ala Arg Gin Pro Ser Val His Pro Pro Leu Leu Val Ala Ser 
115 120 125 

Phe Gly Thr Ala Thr Thr Leu Asp Thr lie Gly Pro Asp Asn Val Phe 
130 135 140 

Pro Gly Gly Leu lie Leu Pro Gly Pro Ala Met Met Arg Gly Ala Leu 
145 150 155 160 

Ala Tyr Gly Thr Ala His Leu Pro Leu Ala Asp Gly Leu Val Ala Asp 
165 170 175 

Tyr Pro He Asp Thr His Gin Ala He Ala Ser Gly He Ala Ala Ala 
180 185 190 

Gin Ala Gly Ala He Val Arg Gin Trp Leu Ala Gly Arg Gin Arg Tyr 
195 200 205 

Gly Gin Ala Pro Glu He Tyr Val Ala Gly Gly Gly Trp Pro Glu Val 
210 215 220 

Ara Gin Glu Ala Glu Arg Leu Leu Ala Val Thr Gly Ala Ala Phe Gly 
225 230 235 240 

Ala Thr Pro Gin Pro Thr Tyr Leu Asp Ser Pro Val Leu Asp Gly Leu 
245 250 255 

Ala Ala Leu Ala Ala Gin Gly Ala Pro Thr Ala 
260 265 



<210> 16 
<211> 702 
<212> DNA 

<213> Bacillus subtilis 

<220> 

<221> CDS 

<222> (1) . . (699) 

<400> 16 

ttg tta ctg gtt ate gat gtg ggg aac acc aat act gta ctt ggt gta 4 8 

Met Leu Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

tat cat gat gga aaa tta gaa tat cac tgg cgt ata gaa aca age agg 96 
Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 

cat aaa aca gaa gat gag ttt ggg atg att ttg cgc tec tta ttt gat 144 
His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 



cac tec ggg ctt atg ttt gaa cag ata gat ggc att att att teg tea 192 
His Ser Gly Leu Met Phe Glu Gin He Asp Gly He He He Ser Ser 
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50 55 60 

gta gtg ccg cca ate atg ttt gcg tta gaa aga atg tgc aca aaa tac 
Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 



cgc tea ttg cga acg aat cag att gta tag 
Arg Ser Leu Arg Thr Asn Gin lie Val 
225 230 



240 



432 



ttt cat ate gag cct caa att gtt ggt cca ggt atg aaa ace ggt tta 288 
Phe His He Glu Pro Gin He Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

aat ata aaa tat gac aat ccg aaa gaa gta ggg gca gac aga ate gta 336 
Asn He Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg He Val 
100 105 110 

aat get gtc get gcg ata cac ttg tac ggc aat cca tta att gtt gtc 384 
Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

gat ttc gga acc gec aca acg tac tgc tat att gat gaa aac aaa caa 
Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

tac atg ggc ggg gcg att gec cct ggg att aca att teg aca gag gcg 480 
Tyr Met Gly Gly Ala He Ala Pro Gly He Thr He Ser Thr Glu Ala 
145 150 155 160 

ctt tac teg cgt gca gca aag ctt cct cgt ate gaa ate acc egg ccc 528 
Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg lie Glu lie Thr Arg Pro 
165 170 175 

gac aat att ate gga aaa aac act gtt age gcg atg caa tct gga atr 57 6 
Asp Asn lie lie Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

tta ttt ggc tat gtc ggc caa gtg gaa gga ate gtt aag cga atg aaa 624 
Leu Phe Gly Tyr Val Gly Gin Val Glu Gly He Val Lys Arg Met Lys 
195 200 205 

tgg cag gca aaa cag gac cca agg tea ttg cga cag gag gee tgg cgc 672 
Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 215 220 



702 



<210> 17 
<2H> 233 
<212> PRT 

<213> Bacillus subtilis 
<400> 17 

Met Leu Leu Val lie Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr His Asp Gly Lys Leu Glu Tyr His Trp Arg He Glu Thr Ser Arg 
20 25 30 



His Lys Thr Glu Asp Glu Phe Gly Met He Leu Arg Ser Leu Phe Asp 
35 40 45 
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His Ser Gly Leu Met Phe Glu Gin lie Asp Gly lie lie lie Ser Ser 
50 55 60 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Thr Lys Tyr 
65 70 75 80 

Phe His lie Glu Pro Gin lie Val Gly Pro Gly Met Lys Thr Gly Leu 
85 90 95 

Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 
100 105 110 

Asn Ala Val Ala Ala He His Leu Tyr Gly Asn Pro Leu He Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr He Asp Glu Asn Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala He Ala Pro Gly lie Thr He Ser Thr Glu Ala 
145 150 155 160 

Leu Tyr Ser Arg Ala Ala Lys Leu Pro Arg He Glu He Thr Arg Pro 
165 170 175 

Asp Asn He He Gly Lys Asn Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

Leu Phe Gly Tyr Val Gly Gin Val Glu Gly lie Val Lys Arg Met Lys 
195 200 205 

Trp Gin Ala Lys Gin Asp Pro Arg Ser Leu Arg Gin Glu Ala Trp Arg 
210 215 220 

Arg Ser Leu Arg Thr Asn Gin He Val 
225 230 



<210> 18 
<211> 163 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (113) . . (118) 

<220> 

<221> -10_signal 
<222> (136) . . (141) 

<400> 18 

gcctacctag cttccaagaa agatatccta acagcacaag agcggaaaga tgttttgttc 60 

tacatccaga acaacctctg ctaaaattcc tgaaaaattt tgcaaaaagt tgttgacttt 120 

atctacaagg tgtggtataa taatcttaac aacagcagga cgc 163 
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<210> 19 
<211> 194 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence : promoter 
sequence 

<220> 

<221> -35_signal 
<222> (136) . . (141) 

<220> 

<221> -10_signal 
<222> (159) . . (164) 

<400> 19 

gctattgacg acagctatgg ttcactgtcc accaaccaaa actgtgctca gtaccgccaa 60 
tatttctccc ttgaggggta caaagaggtg tccctagaag agatccacgc tgtgtaaaaa 120 
ttttacaaaa aggtattgac tttccctaca gggtgtgtaa taatttaatt acaggcgggg 180 
gcaaccccgc ctgt 194 



<210> 20 
<211> 248 
<212> PRT 

<213> Pseudomonas aeruginosa 
<400> 20 

Met lie Leu Glu Leu Asp Cys Gly Asn Ser Leu lie Lys Trp Arg Val 
15 10 15 

lie Glu Gly Ala Ala Arg Ser Val Ala Gly Gly Leu Ala Glu Ser Asp 
20 25 30 

Asp Ala Leu Val Glu Gin Leu Thr Ser Gin Gin Ala Leu Pro Val Arg 
35 40 45 

Ala Cys Arg Leu Val Ser Val Arg Ser Glu Gin Glu Thr Ser Gin Leu 
50 55 60 

Val Ala Arg Leu Glu Gin Leu Phe Pro Val Ser Ala Leu Val Ala Ser 
65 70 75 80 

Ser Gly Lys Gin Leu Ala Gly Val Arg Asn Gly Tyr Leu Asp Tyr Gin 
85 90 95 

Arg Leu Gly Leu Asp Arg Trp Leu Ala Leu Val Ala Ala His His Leu 
100 105 110 

Ala Lys Lys Ala Cys Leu Val lie Asp Leu Gly Thr Ala Val Thr Ser 
115 120 125 

Asp Leu Val Ala Ala Asp Gly Val His Leu Gly Gly Tyr lie Cys Pro 
130 135 140 
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Gly Met Thr Leu 
145 

Arg Tyr Asp Asp 



Gly Gin Ala Thr 
180 

Arg Gly Phe Val 
195 

Pro Asp Cys Glu 
210 

Asp Glu Leu Ala 
225 



Met Arg Ser Gin 
150 

Ala Glu Ala Arg 
165 

Ala Glu Ala Val 



Arg Glu Gin Tyr 
200 

He Phe Leu Thr 
215 

Gly Ala Arg lie 
230 



Leu Arg Thr His 
155 

Arg Ala Leu Ala 
170 

Glu Arg Gly Cys 
185 

Ala Met Ala Cys 



Gly Gly Asp Ala 
220 

Met Pro Asp Leu 
235 



Thr Arg Arg lie 
160 

Ser Leu Gin Pro 
175 

Leu Leu Met Leu 
190 

Glu Leu Leu Gly 
205 

Glu Leu Val Arg 



Val Phe Val Gly 
240 



Leu Ala Leu Ala Cys Pro lie Glu 
245 



<210> 21 
<211> 209 
<212> PRT 

<213> Campylobacter jejuni 
<400> 21 

Met Leu Leu Cys Asp He Gly Asn Ser Asn Ala Asn Phe Leu Asp Asp 
15 10 15 

Asn Lys Tyr Phe Thr Leu Asn He Asp Gin Phe Leu Glu Phe Lys Asn 
20 25 30 

Glu Gin Lys He Phe Tyr He Asn Val Asn Glu His Leu Lys Glu His 
35 40 45 

Leu Lys Asn Gin Lys Asn Phe He Asn Leu Glu Pro Tyr Phe Leu Phe 
50 " 55 60 

Asp Thr He Tyr Gin Gly Leu Gly He Asp Arg He Ala Ala Cys Tyr 
65 70 75 80 

Thr He Glu Asp Gly Val Val Val Asp Ala Gly Ser Ala He Thr He 
85 90 95 

Asp lie lie Ser Asn Ser He His Leu Gly Gly Phe He Leu Pro Gly 
100 105 110 

He Ala Asn Tyr Lys Lys He Tyr Ser His He Ser Pro Arg Leu Lys 
115 120 125 

Ser Glu Phe Asn Thr Gin Val Ser Leu Asp Ala Phe Pro Gin Lys Thr 
130 135 140 

Met Asp Ala Leu Ser Tyr Gly Val Phe Lys Gly He Tyr Leu Leu He 
145 150 155 160 



Lys Asp Ala Ala Gin Asn Lys Lys Leu Tyr Phe Thr Gly Gly Asp Gly 
165 170 175 
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Gln Phe Leu Ala Asn Tyr Phe Asp 
180 

lie Phe Arg Gly Met Lys Lys lie 
195 200 



His Ala lie Tyr Asp Lys Leu Leu 
185 190 

lie Lys Glu Asn Pro Asn Leu Leu 
205 



<210> 22 
<211> 592 
<212> PRT 

<213> Neisseria meningitidis 
<400> 22 

Met Thr Val Leu Lys Pro Ser His Trp Arg Val Leu Ala Glu Leu Ala 
15 10 15 

Asp Gly Leu Pro Gin His Val Ser Gin Leu Ala Arg Met Ala Asp Met 
20 25 30 

Lys Pro Gin Gin Leu Asn Gly Phe Trp Gin Gin Met Pro Ala His lie 
35 40 45 

Arg Gly Leu Leu Arg Gin His Asp Gly Tyr Trp Arg Leu Val Arg Pro 
50 55 60 

Leu Ala Val Phe Asp Ala Glu Gly Leu Arg Glu Leu Gly Glu Arg Ser 

65 70 75 80 

Gly Phe Gin Thr Ala Leu Lys His Glu Cys Ala Ser Ser Asn Asp Glu 
85 90 95 

lie Leu Glu Leu Ala Arg lie Ala Pro Asp Lys Ala His Lys Thr lie 
100 105 110 

Cys Val Thr His Leu Gin Ser Lys Gly Arg Gly Arg Gin Gly Arg Lys 
115 120 125 

Trp Ser His Arg Leu Gly Glu Cys Leu Met Phe Ser Phe Gly Trp Val 
130 135 140 

Phe Asp Arg Pro Gin Tyr Glu Leu Gly Ser Leu Ser Pro Val Ala Ala 
145 150 155 160 

Val Ala Cys Arg Arg Ala Leu Ser Arg Leu Gly Leu Lys Thr Gin lie 
165 170 175 

Lys Trp Pro Asn Asp Leu Val Val Gly Arg Asp Lys Leu Gly Gly lie 
180 185 190 

Leu lie Glu Thr Val Arg Thr Gly Gly Lys Thr Val Ala Val Val Gly 
195 200 205 

lie Gly lie Asn Phe Val Leu Pro Lys Glu Val Glu Asn Ala Ala Ser 
210 215 220 



Val Gin Ser Leu Phe Gin Thr Ala Ser Arg Arg Gly Asn Ala Asp Ala 
225 230 235 240 
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Ala Val Leu Leu 



Gin Tyr Ala Arg 
260 

Ala Asn Arg Asp 
275 

Thr Val Phe Glu 
290 

His Leu Glu Thr 
305 

Ser Leu Arg Ser 



Ser Glu Arg Phe 
340 

Ala Trp Val Glu 
355 

Arg Asp Leu Ser 
370 

Asn Val Arg lie 
385 

Gin Val Gin Glu 



Ala Gin Ala Leu 
420 

Gly Ser Asp Arg 
435 

Asn Ala Cys Val 
450 

Leu Thr Asp Asp 
465 

His Leu Met Lys 



His Ala Gly Lys 
500 

Ala Ser Gly Met 
515 

Gly Arg Leu Lys 
530 

lie Thr Gly Gly 
545 



Glu Thr Leu Leu 
245 

Asp Gly Phe Ala 



His Gly Lys Ala 
280 

Gly Thr Val Lys 
295 

Ala Glu Gly Lys 
310 

Asp Asp Arg Pro 
325 

Leu Leu Leu Asp 



Asn Gly Thr Phe 
360 

Pro Leu Gly Ala 
375 

Val Gly Cys Ala 
390 

Gin Leu Ala Arg 
405 

Gly lie Arg Asn 



Trp Phe Asn Ala 
440 

Val Val Ser Cys 
455 

Gly His Tyr Leu 
470 

Glu Ser Leu Ala 
485 

Arg Tyr Pro Phe 



Met Asp Ala Val 
520 

Glu Lys Thr Gly 
535 

Gly Ala Ala Lys 
550 



Ala Glu Leu Asp 
250 

Pro Phe Val Ala 
265 

Val Leu Leu Leu 



Gly Val Asp Gly 
300 

Gin Thr Val Val 
315 

Val Ser Val Pro 
330 

Gly Gly Asn Ser 
345 

Ala Thr Val Gly 



Glu Trp Ala Glu 
380 

Val Cys Gly Glu 
395 

Lys He Glu Trp 
410 

His Tyr Arg His 
425 

Leu Gly Ser Arg 



Gly Thr Ala Val 
460 

Gly Gly Thr He 
475 

Val Arg Thr Ala 
490 

Pro Thr Thr Thr 
505 

Cys Gly Ser Val 



Ala Gly Lys Pro 
540 

Val Ala Glu Ala 
555 



Ala Val Leu Leu 
255 

Glu Tyr Gin Ala 
270 

Arg Asp Gly Glu 
285 

Gin Gly Val Leu 



Ser Gly Glu He 
320 



Lys Arg Arg Asp 
335 

Arg Leu Lys Trp 
350 

Ser Ala Pro Tyr 
365 

Lys Val Asp Gly 



Phe Lys Lys Ala 
400 

Leu Pro Ser Ser 
415 

Pro Glu Glu His 
430 

Arg Phe Ser Arg 
445 

Thr Val Asp Ala 



Met Pro Gly Phe 
480 

Asn Leu Asn Arg 
495 

Gly Asn Ala Val 
510 

Met Met Met His 
525 

Val Asp Val lie 



Leu Pro Pro Ala 
560 
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Phe Leu Ala Glu Asn Thr Val Arg Val Ala Asp Asn Leu Val lie His 

565 570 575 

Gly Leu Leu Asn Leu lie Ala Ala Glu Gly Gly Glu Ser Glu His Thr 

580 585 590 



<210> 23 
<211> 753 
<212> DNA 

<213> Clostridium acetobutylicum 



<400> 23 

aataagagag 

gttttagatg 

gctgaatgga 

atgaacttat 

tctgttgtac 

aatccattag 

aaagaagttg 

tctcttataa 

gattatcttg 

aaggcagcta 

actatttcaa 

tttgaaaaat 

gccacaggtg 



cagcttttat 
ttggcaatac 
gactatcaac 
ttcaacaaga 
ctaatatcat 
ttgttggacc 
gagccgacag 
taatagattt 
gtggagcaat 
agcttccaag 
gtatacaatc 
tgaaagaaaa 
gtcttgccaa 



gctgctctta 
taatatagtg 
agatgtatta 

taagctcgat 
gtattcttta 
tggaataaaa 
aattgtaaat 
tggaacagca 
atgccctgga 
agtagagctc 
tggaattgtt 
cctgccggac 
acttattaat 



tttttaagga 
ttaggaatat 
agatctgctg 
ccaacattag 
gaacatatga 
acaggaatta 
gctgtagcag 
actacatttt 
attaaagttt 
ataaaaccag 
tatcgatacc 
ggaaggagaa 
tga 



gtgtattaaa 
acaatgatac 
acgaatatgg 

ttgagggagt 
taagaaagta 
atattaaata 
cacatgaaat 
gtgcagtaag 
catcagaggc 
cgtatgctat 
tacgtcaggt 
caaggacctc 



agtgatttta 
gaaacttaca 
aattcaagta 

aataatatcc 
ctttaagata 
cgataatcct 
ttataaaaga 
agaaaatgga 
tctttttgaa 
ttgtaaaaat 
aaaatactta 
cttggtattg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

753 



<210> 24 
<211> 798 
<212> DNA 

<213> Streptomyces coelicolor 



<400> 24 

atgctgctga 

gacatcgtcg 

gtgctcctcc 

atcgacggca 

acccgccgct 

gtcccgatcc 

gcggccgtgg 

ttcgacgcgg 

atctcggtcg 

ccccgcagcg 

ttcgccggcc 

gacgacgtga 

gtcatcgacg 

aacgtgtcgc 



cgatcgacgt 
agcactggcg 
agggcct cat 
tcgccatctg 
actacggcga 
tcaccgacca 
agctctacgg 
tcagcgcgcg 
aggcgctggg 
tgatcggcaa 
aggtcgacgg 
cggtcatcgc 
agcacgagcc 
gcatgtag 



agggaacacg 
catctccacg 
gggcatgcat 
cgcgacggtc 
cgtccccgcg 
ccccaaggag 
cggcccggcg 
cggggagtac 
cgtcaagggc 
gaacacggtc 
cgtcgtcaac 
gacgggcggg 
gtggctgacg 



cacaccgtcc 
gactcgcgcc 
cccctcctcg 
ccctccgtcc 
gtcctcgtcg 
gtcggcgccg 
atcgtcgtgg 
atcggcggcg 
gcccagctcc 
gaggcgatgc 
cgcatggcgc 
ctggcgccga 
ctgatgggtc 



tcggcctctt 
gcacggccga 
gcgacgaact 
tccacgaact 
aaccgggcgt 
accgcatcat 
acttcggcac 
tcatcgcccc 
gcaagatcga 
agtccggcat 
gggagctggc 
tggtcctggg 
tgcgcctggt 



cgacggcgag 
cgaactggcg 
gggcgacggc 
gcgcgaggtc 
caagaccggc 
caacgcggta 
ggcgacgacg 
cggcatcgag 
ggtggcgcgc 
cgtgtacggc 
cgacgacccg 
cgagtcctcg 
gtacgagcgc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

798 



<210> 25 
<211> 819 
<212> DNA 

<213> Mycobacterium tuberculosis 



<400> 25 

gtgctgctgg 

aaagagcacg 

gacgaactgg 

accgccgcct 

tactggccgt 

ctcgtcgaca 

gaccggttcc 



cgattgacgt 
caaaggtcgt 
cactgacgat 
tgtccacggt 
cggtgccgca 
acccgaagga 
ggaaggccgc 



ccgcaacacc 
gcagcagtgg 
cgacgggctg 
cccgtccgtg 
cgtgctgatc 
agtgggcgca 
catcgtcgtt 



cacaccgttg 
cggatacgca 
atcggcgagg 
ctgcacgagg 
gagcccggag 
gaccgcatcg 
gactttggat 



tgggcctgct 
ccgaatccga 
attccgagcg 
tgcggataat 
tacgcaccgg 
tgaactgttt 
cctcgatctg 



gtccggaatg 60 
agtcaccgcc 120 
gctcaccggt 180 
gctcgaccag 240 
gatccctttg 300 
ggccgcctat 360 
tgttgatgtt 420 
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gtatcggcca 
gatgccgcgg 
gtggttggca 
ctggtagacg 
gatgtcgcga 
gtcgaccatt 
ctcgaagtcc 



agggtgaatt 
cggcccgctc 
agaacaccgt 
ggttggtagg 
tcgtggctac 
acgaccagca 
agcgcggccg 



tcttggcggc 
ggcggcattg 
cgaatgcatg 
ccgcatccgc 
cgggcatacc 
cctgaccttg 
gctcaagacg 



gccatcgcgc 
cgccgcgttg 
caagccggtg 
gaggacgtgt 
gcgcccctgc 
cagggtctgc 
gcgcgctga 



ccggggtgca 
aacttgcccg 
cggtgttcgg 
ccggtttctc 
tgctgccgga 
ggctggtgtt 



ggtgtcttcc 480 
cccacgttcg 540 
cttcgccggg 600 
cgtcgaccac 660 
attgcacacc 720 
cgagcgtaac 780 
819 



<210> 26 
<211> 777 
<212> DNA 

<213> Rhodobacter capsulatus 



<400> 26 

atgcttttgt 
gatttcgccg 
gtctggctga 
atctcctcga 
gactgccgcc 
gatccgggca 
catggcggcg 
cccgatggcg 
ctgcatatgg 
ggcacgaata 
gaaggcatcg 
gggggtcttg 
ctgaccatgc 



gcatcgactg 
ccacctggcg 
acacgctgat 
ccgcgccgcg 
cctatgtcgt 
ccacggtcgg 
atctgatcgt 
cctatatcgg 
cggcggccgc 
cggtggcctg 
tgcggcagat 
cctcgctctt 
atggtctgcg 



cggcaacacc 
catcgccacc 
gcaactgaag 
ggtggtgttc 
cggcaaaccg 
gccggaccgg 
cgtcgatttc 
cggggtgatc 
gctgccgcat 
catccaatcc 
ccggatggaa 
cgatctgggt 
tctgatcttc 



aacaccgtgt 
gatcatcgcc 
ggcctgcagg 
aacctgcgcg 
ggctgcgagc 
ctggtcaata 
ggcaccgcca 
gcgcccgggg 
gtcgacgtca 
ggggtgtatt 
cgtgaccgtc 
ttcgatctgt 
gattacaaca 



tttcggtctg 
gcaccgccga 
gccggatctc 
ttctgtgcaa 
tgccggtggc 
cggtggcggg 
ccacctttga 
tgaacctgag 
cgaaaccgca 
ggggctatat 
cgatgaaggt 
tcgacaaggt 
agggacttgg 



ggacgggacg 
cgaatatttc 
cgaggcgatc 
ccgctatttc 
gccgcgcgtc 
ctatgaccgt 
cgtggtggcc 
ccttgaggcg 
aggggtgatc 
cggccttgtc 
gattgccacc 
cgaggatgac 
ggcgtga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

777 



<210> 27 
<211> 768 
<212> DNA 

<213> Geobacter sulfurreducens 



<400> 27 

gtgcttcttg ttatagacgt gggtaatacc 

cgcctggtga gggattggcg ggtctccacg 

attctcataa atgagttgtt ccgcttggcg 

atcatctcct cggtggtgcc gcccctcacc 

ttcgggatgc gtcccctggt ggtgggaccg 

gacaaccccc gggaggtggg ggccgaccgg 

taccgcacct ctctcattat cgtcgatttc 

cgcaagggag agtactgcgg aggggccatc 

ctgttccagc gggccagcaa gctgcccagg 

gccaggaaca cggtcaattc gatgcaggcg 

gacgagatcg tcacccggat gaaggccgag 

ggagggttgg cgtccctcat agcgccggag 

ctgacactgg agggattgcg catactgtac 



aatatcgtgc tcgggattta cgatggcgag 60 

gacaaggccc gtactaccga cgagtacggt 3 20 

ggccttgggc tcgatcagat ccgcgcggtg 180 

ggcgtgctgg agcgtctttc cctggggtat 24 0 

ggcatcaaga caggcatgcc aatccagtac 300 

atcgtgaacg cggtggcggg gtacgagaag 360 

ggcaccgcta ccacgttcga ctacgtgaac 420 

gcgccgggac tcgtcatttc caccgaggcc 480 

gttgatatca tacgtccgtc cgcgatcatt 540 

ggaatttact atggttacgt ggggctcgta 600 

agcaaggatg cgccccgggt tatcgctacc 660 

tccaagacca tcgaagccgt cgaggaatat 720 

gaacgaaaca gggagtga 7 68 



<210> 28 
<211> 789 
<212> DNA 

<213> Deinococcus radiodurans 



<400> 28 

gtgcccgctt ttcccctgct cgccgtggac 
gccgacgcct cgggcgccct gacccacacc 
cccgacgacc tcgcgctgca actgcacggg 
cgcgccgccg tgctgagcag cgtggcgccc 
aagcggcact tcatgatcga cgcttttgcc 
gtggaactcg acacgccggg ctcggtgggt 
gaaaagtacc tgggggggct ggactacgcg 



atcggcaaca ccaccaccgt cctgggtctg 60 

tggcggattc ggaccaaccg cgagatgctg 120 

ctctttaccc tcgccggggc gccgattccc 180 

ccggtgggcg aaaactacgc gctcgcgctc 240 

gtgagtgccg agaacctgcc cgacgtgacg 300 

gcggaccgcc tgtgcaacct cttcggcgcc 360 

gtggtagtgg atttcgggac ctccaccaac 420 
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tttgacgtgg 
gtcagcgccg 
cccgagacgg 
tacgccgaga 
gtcgccgtcg 
tactacgacg 
gtccgctga 



tggggcgggg 
acgccctgtt 
ccatcggcaa 
tggtggacgg 
ccactggcgg 
aaacgctgac 



gcggcgtttc 
cgcccgcgcc 
aaacaccgtc 
cctgctgcgc 
cttctcgcgc 
gttgcgcggg 



ctcggcggca 
gccaaactgc 
cacgcgctgc 
cgcatccgcg 
accgtgcagg 
ttggtggagc 



tcctcgccac 
cgcgcatcac 
aatcgggcct 
ccgagttgcc 
ggatttgcca 
tgtgggcgag 



cggagcgcag 480 
cctgcaagcg 540 
ggtcttcggc 600 
gggcgaagcg 660 
ggaaatcgac 720 
ccgttcggag 7 80 
789 



<210> 29 
<211> 741 
<212> DNA 

<213> Thermotoga maritima 



<400> 29 

ttgtacctcc 

aaaactttca 

tcacaccttc 

tccgtcgttc 

tcaccgatat 

gaagtgggtg 

ggaatcatca 

gaaggaggag 

gcaaaacttc 

gaaaacatca 

cgaataaagg 

gtgaaagata 

catttctgct 



tcgtggacgt 
gaaggtggag 
atcctcttct 
ccactcagaa 
gggtgaaggc 
ctgatagggt 
tcgacatggg 
ccattttgcc 
cgctcgttga 
ggctgggtgt 
aagtttacgg 
tgataaaaca 
tcggagattg 



gggtaacacg 
gctgtccacc 
gggcgatgct 
cacagtcata 
gaaaaacgga 
ggccaacgtt 
aacggcaacc 
tggattcttc 
ggtaaaacca 
ggtgaacgga 
tgatttaccg 
cgagattttc 
a 



cattctgtct 
ggtgtgtttc 
atgcgtgaga 
gagcgttttt 
tgtgtgaaat 
gtcgctttcg 
accgtggatc 
atgatggt u C 
gcggattttg 
agtgtctacg 
gtggttctca 
gatgaggacc 



tctctatcac 
agacggaaga 
taaaggggat 
ctcaaaagta 
ggaacgtgaa 
tcaaggaata 
ttgttgtgaa 
actcgctctt 
ttgtaggaaa 
ctcttgaggg 
cgggaggtca 
tcacgatcaa 



cgaagatggt 
cgaactcttt 
aggagtggcc 
tttccacata 
gaatccctcg 
cggtaaaaac 
cggatcttac 
tcggggaacg 
ggatacggag 
gataataggg 
gtcgaagatc 
gggggtgtac 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

741 



<210> 30 
<211> 822 
<212> DNA 

<213> Treponema pallidum 



<400> 30 

atgcttttga 

ggccgtgtgt 

gagtactcgc 

cgtgatgcgt 

gctcagatta 

gtgcgcatac 

gcggcctatg 

tttacggcgg 

actgcggtgc 

cctgattccg 

acgctctttg 

gcagcggtga 

atcgatgcac 

ctcctgccac 



tagacgtagg 
gcgtgcgtga 
ttctcatcca 
ttatttcctc 
gcggcgtcca 
cagagccagt 
tgcatttccg 
tggatggcac 
agtctctcca 
ttctgggcaa 
ttattcgcgc 
taacgggggg 
agctgacgct 
ctgctacagt 



gaactcgcac 

gttgtttcgc 
tgcgct ttgc 
cgtcgtgcct 
gccggttgtc 
gcgcgcggaa 
ttctgcttgc 
ggggttgatt 
tacaggaacg 
ggatactacg 
tatgattgca 
gctttcgcgt 
ctcaggtctt 
gtcaggttca 



gtagtgttcg 
cttgcgcctg 
gaacgtgcgg 
gtgttgacaa 
tttggcccgt 
attggcactg 
gtggtagtgg 
caaggggtgg 
gcacaattac 
catgcggtgc 
cagtgtcaga 
cttttctcgt 
gcacatattg 
tcggggaatt 



gaatccaagg 
acgcgcgtaa 
gggtcggccg 
agaccattgc 
gggcgtacga 
acttggtagc 
attgtggaac 
caattgcgcc 
cacttgttcc 
aggcgggtgt 
aagagttagg 
cagaggtgga 
cgcggctggt 
ga 



cgagaatggt 60 
aacccaagat 120 
tgcttctctc 180 
agatgcggtc 240 
gcacttgccg 300 
caacgcggtg 360 
agcgctcacc 420 
tggtctgcgc 480 
tcttgccctg 540 
ggtgcggggc 600 
gtgccgctgt 660 
ctttcctcct 720 
gccgacatct 780 
822 



<210> 31 
<211> 789 
<212> DNA 

<213> Borrelia burgdorferi 
<400> 31 

atgaataaac ctttattatc agaattgata 
gccttattta aagataatca agttaattta 
aggtatgatg aggtttatag cttttttgaa 
tttataagca gcgttgttcc tattcttaat 
tttaagataa agcctttgtt tattggtttt 



attgatattg gaaataccag cattgctttt 60 
tttattaaaa tgaaaacaaa tcttatgtta 120 
gaaaattttg attttaatgt aaataaagtt 180 
gaaacattta aaaatgtcat tttttctttt 240 
gatttgaatt atgatttgac atttaatcct 300 
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tacaaaagcg 
gaaaattatt 
gctgttagca 
tttaattctt 
aatcttttag 
tatttaatag 
ataattactg 
aatattcatt 
gttaattga 



ataaattttt 
catttgaaaa 
ggcaagatgg 
tattagataa 
agagaacgac 
aaggtgttta 
ggggtaatgc 
taactgtaga 



gctaggttca 
tgttttagta 
aatactcggt 
tgcctatctt 
atctgggagt 
tcgtgatatt 
ggacctaatt 
aggcgttaga 



gacgtttttg 
gtagaccttg 
ggtattataa 
atcaaaaaat 
gtaaacagcg 
aagcagatgt 
ttgtcattaa 
attttaggaa 



ccaatcttgt 
gaactgcttg 
attctggtcc 
tccccattag 
gtttatttta 
ataaaaaaaa 
ttgagataga 
attctattga 



tgcagccatt 360 
caccattttt 420 
tttgataaat 480 
cactccaaat 540 
tcaatataag 600 
atttaattta 660 
gtttattttt 720 
ctttaagttt 780 
789 



<210> 32 
<211> 690 
<212> DNA 

<213> Aquifex aeolicus 



<400> 32 

atgaggtttt 

aagaaagtaa 

aaattaaaag 

ataccgaaga 

cctgaaacgc 

aagaatgttg 

aaatttaagg 

ctggcggagg 

acacgagagt 

ctgaaactct 

gggaagtact 

aatttacttt 



tgacggtaga 
aagattttct 
cgctcggaat 
taaagttttt 
tgggcacgga 
tagtaa tcag 
gagggtttat 
gaattcccga 
gcgtcctggg 
ggagaaaagt 
tttccaagtt 
acctctatca 



cgtagggaat 
gaaactttca 
atccgtaaaa 
aaagaaggaa 
cagggtagca 
tgcgggtact 
taccttagga 
gttttttccc 
aggggcttac 
atttaaaaga 
cggtatttac 
caggatttaa 



tcctccgttg 
cacgaagaat 
cagagtttta 
aactttccta 
cttgcttact 
gcccttgtaa 
cttggaaaga 

gaagaggtag 
agggagagca 
aagttcaaag 
gacccactcc 



atatcgccct 
ttttaaagga 
gcgaaaaagt 
tacaggttga 
ccgccaaaaa 
ttgacctagt 
agttaaaaat 
aaatttttct 
cagaatttat 
tggttataac 
ttgttcacag 



atgggaaggg 
agaatttcct 
aaggggaaaa 
ttacaaaact 
gttttacgga 
tcttgagggc 
tctttccgac 
tgggcgttct 
taaaagtaca 
gggcggagag 
gggcatgaga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

690 



<210> 33 
<211> 774 
<212> DNA 

<213> Synechocystis sp. 



<400> 33 

gtggaaacat 

ttaggcctaa 

ctgcaaacct 

aaagttcctc 

cagcctaaaa 

att gaccggg 

gtt gatggag 

ggagcgatct 

ctaccgaagt 

cccagcgcca 

caggattggc 
atattacatg 
ttgatcttcc 



caaagccggg 
tgataggcaa 
gggttacaga 
tgatgttggc 
ttttgaccct 
ccctggctgg 
gcactgcttt 
tgcccggttt 
tagaaatgga 
tttttagtgg 

aaaagctttt 
gcttcctaaa 
tcggtatggc 



ttgtggttta 
ctcccgtctg 
ttacaacccc 
atcggtggta 
gaagaatctt 
tttagggacg 
gaccattaca 
gggattgcag 
tcaattaacc 
tgttgtctat 
tcctggtgcc 
agagcattct 
ggccatacac 



gccctggata 
cactgggcat 
aagtcagctc 
ccggaacaaa 
cccctggtca 
gggctgacct 
gggtttgacc 
ttagcaaccc 
gagttgcctg 
ggcgtgttgg 

gccatggtta 
cctaatcttt 
cacggcgatc 



atgacaagca 
attgtagcgg 
agttgccggt 
ccgaagtttg 
atctttaccc 
acggctttcc 
aagataaaaa 
ttggcgatcg 
accgttgggc 
gggcattgca 
tcaccggggg 
cggtggcctg 
gccccatctg 



aaaaccttgg 
caatgctccc 
tttgttgggg 
gcgagtatat 
cagctttggc 
ctgtctagtg 
actggtgggg 
cctggcggcc 
tttagatacc 
gagttatctc 

agacggcaag 
ggatgacaat 
ttag 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

774 



<210> 34 
<211> 672 
<212> DNA 

<213> Helicobacter pylori 
<400> 34 

atgccagcta ggcaatcttt caaggattta 
acacgcatcc atttcgcgca aaactaccag 
cgtttgggta ttcaaaagga aattttttac 
cttttaaatt gttaccctaa cgctaaaaat 
tatatagggc ttgggataga ccggcaaatg 



aaagacttga ttttatgcga tataggcaac 60 
ctcttttcaa gcgctaaaga agatttaaag 120 
attagtgtga atgaagaaaa tgaaaaagct 180 
atcgcagggt tttttcattt agaaaccgac 240 
gcatgtttag cggtggttaa tggggttata 300 
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gtggatgctg ggagcgcgat tacgattgat ttagtcaaag agggcaagca tttaggaggg 360 
tgtattttgc ccggtttagc ccaatatgtc catgcgtata aaaaaagcgc gaaaatctta 420 
gagcaacctt tcaaagcctt agattcttta gaagttttac ccaaaaacac cagagacgct 480 
gtgaattacg gcatgatttt gagtatcatc tcttgtatcc aacatttagc taaagatcaa 540 
aaaatctatc tttgtggggg cgatgcgaaa tatttgagcg cgtttttacc tcattctgtt 600 
tgcaaggagc gtttggtttt tgacgggatg gaaatcgctc ttaaaaaagc agggatacta 660 
gaatgcaaat ga 672 



<210> 35 
<211> 747 
<212> DNA 

<213> Pseudomonas aeruginosa 



<400> 35 

atgattcttg agctcgactg tggaaactcg ctgatcaagt ggcgggtcat cgagggggcg 60 

gcgcggtcgg tcgccggtgg ccttgcggag tccgatgatg ccctggtcga acagttaacg 120 

tcgcagcaag cgctgccagt gcgagcctgt cgcctggtga gcgttcgcag cgagcaggaa 180 

acctcgcaac tggtcgcacg gttggagcag ctgttcccgg tttcggcgct ggttgcatca 240 

tccggcaagc agttggcggg tgtgcgcaac ggctatcnzcg attaccagcg cctggggctc 300 

gaccgctggc tggccctcgt cgcggctcat cacctggcta agaaggcctg cctggtcatt 360 

gatctgggga ccgcggtcac ctctgacctg gtcgcggcgg atggagtgca tctggggggc 420 

tacatatgcc cgggcatgac cctgatgaga agccagttgc gcacccatac ccgacgtatc 480 

cgctacgacg atgcagaggc ccggcgggcg cttgccagtc tccagccagg gcaggccacg 540 

gccgaggcgg ttgagcgggg ttgtctgctc atgctcaggg ggttcgttcg tgagcagtac 600 

gccatggcgt gcgagctgct cggtccggat tgtgaaatat tcctgacggg tggggatgcc 660 

gaactggttc gcgacgaact ggctggcgcc cggatcatgc cggacctggt tttcgtaggg 720 
ctggcactgg cttgcccgat tgagtga 747 



<210> 36 
<211> 630 
<212> DNA 

<213> Campylobacter jejuni 



<400> 36 

atgttgctct 

act cttaata 

gtcaatgaac 

tattttttat 

actattgaag 

aattctatac 

agccatattt 

ccacaaaaaa 

aaagatgccg 

aattatttcg 

ataaaagaaa 



gtgatattgg 
tagatcagtt 
atctcaaaga 
ttgatacaat 
atggagttgt 
atcttggtgg 
caccacgatt 
ccatggatgc 
ctcaaaataa 
atcacgcaat 
atcccaattt 



gaattcaaat 
tttagaattt 
acatttaaaa 
ttatcaagga 
tgtagatgca 
ttttatcttg 
aaaaagtgaa 
tttaagttat 
aaagctttat 
ttatgataaa 
actttattaa 



gctaatttcc 
aaaaatgaac 
aatcaaaaaa 
ttaggaatcg 
ggtagtgcta 
ccaggtattg 
tttaacactc 
ggtgttttta 
ttcactggtg 
cttttaatct 



tagatgataa 
aaaaaatttt 
attttatcaa 
atcgcatagc 
ttacaattga 
caaattataa 
aagttagtct 
aaggaattta 
gagatgggca 
ttcgaggaat 



caaatatttt 
ttatatcaat 
tcttgaacct 
agcttgttat 
tattatttct 
aaaaatttat 
tgatgcattc 
cctactgata 
atttttagca 
gaaaaagatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

630 



<210> 37 
<211> 1779 
<212> DNA 

<213> Neisseria meningitidis 



<400> 37 

atgacggttt tgaagccttc gcactggcgg 
caacacgtct cgcaactggc gcgtatggcg 
tggcagcaga tgccggcgca catacgcggg 
ctggtgcgcc cattggcggt tttcgatgcc 
ggttttcaga cggcattgaa gcacgagtgc 
gcgcggattg cgccggacaa ggcgcacaaa 
ggcagggggc ggcaggggcg gaagtggtcg 
tttggctggg tgtttgaccg gccgcagtat 
gtggcgtgcc ggcgcgcctt gtcgcgtttg 



gtgttggcgg agcttgccga cggtttgccg 60 
gatatgaagc cgcagcagct caacggtttt 120 
ctgttgcgcc aacacgacgg ctattggcgg 180 
gaaggtttgc gcgagctggg ggaaaggtcg 240 
gcgtccagca acgacgagat actggaattg 300 
accatatgtg tgacccacct gcaaagtaag 360 
caccgtttgg gcgagtgtct gatgttcagt 420 
gagttgggtt cgctgtcgcc tgttgcggca 480 
ggtttgaaaa cgcaaatcaa gtggccaaac 540 
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gatttggtcg 
ggcaaaacgg 
aacgccgctt 
gccgtgttgc 
gacggatttg 
gtattgctgt 
caaggcgttc 
agcctgcggt 
ctgctgttgg 
gcaaccgtcg 
aaggtggatg 
caagtgcagg 
ggcatacgca 
tt gggcagcc 

acggttgacg 
cacctgatga 
cgttatcctt 
tgcggctcgg 
gtcgatgtca 
tttttggcgg 
ctgattgccg 



tcggacgcga 
ttgccgtggt 
ccgtgcaatc 
tggaaacgct 
cgccttttgt 
tgcgcgacgg 
tgcacttgga 
ccgacgacag 
acggcggcaa 
gtagcgcgcc 
gaaatgtccg 
aacagctcgc 
accactaccg 
gccgcttcag 
cgctcaccga 
aagaatcgct 
tcccgaccac 
ttatgatgat 
tcattaccgg 
aaaataccgt 
ccgaaggcgg 



caaattgggc 
cggtatcggc 
gctgtttcag 
gttggcggaa 
ggcggaatat 
cgaaaccgtg 
aacggcagag 
gccggtttcc 
cagccggctc 
gtaccgcgat 
catcgtcggt 
ccgaaaaatc 
ccaccccgaa 
ccgcaacgcc 

tgacggacat 
cgccgtccga 
aacgggcaat 
gcacgggcgt 
cggcggcgcg 
gcgcgtggcg 
ggaatcggaa 



ggcattctga 
atcaatttcg 
acggcatcgc 
cttgatgcgg 
caggctgcca 
ttcgaaggca 
ggcaaacaga 
gtgccgaagc 
aagtgggcgt 
ttgtcgcctt 
tgcgccgtgt 
gagtggctgc 
gaacacggtt 
tgcgtcgtcg 

tatctcgggg 
accgccaacc 
gccgtcgcca 
ttgaaagaaa 
gcaaaagttg 
gacaacctcg 
catacttaa 



ttgaaacggt 
tgctgcccaa 
ggcggggaaa 
tgttgttgca 
accgcgacca 
cggttaaagg 
cggtcgtcag 
ggcgggattc 
gggtggaaaa 
tgggcgcgga 
gcggagaatt 
cgtcttccgc 
ccgaccgctg 
tcagttgcgg 

gaaccatcat 
tcaaccggca 
gcggcatgat 
aaaccggggc 
ccgaagccct 
tcattcacgg 



caggacgggc 
ggaagtggaa 
tgccgatgcc 
atatgcgcgg 
cggcaaggcg 
cgtggacgga 
cggcgaaatc 
ggaacgtttt 
cggcacgttc 
gtgggcggaa 
caaaaaggca 
acaggctttg 
gttcaacgcc 
cacggcggta 
gcccggtttc 
cgccggtaag 
ggatgcggtt 
gggcaagcct 
gccgcctgca 
gctgctgaac 



600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1779 



<210> 38 
<211> 804 
<212> DNA 

<213> Bordetella pertussis 



<400> 38 

atgattatcc tcatcgactc 
gcgccgcagg cggcgcgcga 
ctgggccgct ggctggccac 
gccgggcttg cccgcggcga 
cggtggctgc gggcccagcc 
caactgggcg ccgaccgctg 
cacccgccgc tgctggtggc 
gacaatgtct ttcccggcgg 
gcctacggca ccgcccacct 
acccatcagg ccatcgccag 
tggctggccg gccgccaacg 
tggcccgaag tgcggcagga 
gccacgccgc agcccactta 
gcgcaaggcg cgccaacggc 



cggcaacagc cgcctcaaag 
gcccgccccc gtcgccttcg 
cctgcccagg cgcccgcaac 
agccattgcc gccacgctgc 
cctggccatg gggctgcgca 
ggcgtgcatg gtgggcgtgc 
cagtttcggc acggccacca 
gctgatcctg cccggccccg 
gcccctggcc gacggcctgg 
cggcatcgcc gccgcccagg 
ctacggccag gcgccggaga 
agccgagcgc ctgctggcgg 
cctcgacagc cccgtgctcg 
ctga 



tcggctggtt tgacccggac 60 
acaatctcga cctggacgcg 120 
gggcgctggg cgtgaacgtc 180 
gcgcgggcgg ttgcgacatc 24C 
acggctatcg caatcccgac 300 
tggcgcgcca gccgtccgtg 360 
cgctggacac catcgggccc 420 
ccatgatgcg cggcgcgctg 480 
tggccgacta ccccatcgac 540 
ccggcgcgat cgtgcggcaa 600 
tctatgtcgc cggcggcggg 660 
tcaccggcgc cgccttcggc 720 
acggcctggc ggcgctcgcc 780 

804 



<210> 39 
<211> 460 
<212> PRT 
<213> Neisseria 

<400> 39 

Met Gly Glu Cys 

1 

Gin Tyr Glu Leu 
20 

Arg Ala Leu Gly 
35 

Asp Leu Val Val 



gonorrhoeae 

Leu Met Phe Ser 
5 

Gly Ser Leu Ser 

Cys Leu Gly Leu 
40 

Gly Arg Asp Lys 



Phe Gly Trp Ala 
10 

Pro Val Ala Ala 
25 

Glu Thr Gin lie 



Leu Gly Gly lie 



Phe Asp Arg Pro 
15 

Leu Ala Cys Arg 
30 

Lys Trp Pro Asn 
45 

Leu lie Glu Thr 
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50 55 60 

Val Arg Ala Gly Gly Lys Thr Val Ala Val Val Gly lie Gly lie Asn 
65 70 75 80 

Phe Val Leu Pro Lys Glu Val Glu Asn Ala Ala Ser Val Gin Ser Leu 
85 90 95 

Phe Gin Thr Ala Ser Arg Arg Gly Asn Ala Asp Ala Ala Val Leu Leu 
100 105 110 

Glu Thr Leu Leu Ala Glu Leu Gly Ala Val Leu Glu Gin Tyr Ala Glu 
115 120 125 

Glu Gly Phe Ala Pro Phe Leu Asn Glu Tyr Glu Thr Ala Asn Arg Asp 
130 135 140 

His Gly Lys Ala Val Leu Leu Leu Arg Asp Gly Glu Thr Val Cys Glu 
145 150 155 160 

Gly Thr Val Lys Gly Val Asp Gly Arg Gly Val Leu His Leu Glu Thr 
165 170 175 

Ala Glu Gly Glu Gin Thr Val Val Ser Gly Glu lie Ser Leu Arg Pro 
180 185 190 

Asp Asn Arg Ser Val Ser Val Pro Lys Arg Pro Asp Ser Glu Arg Phe 
195 200 205 

Leu Leu Leu Glu Gly Gly Asn Ser Arg Leu Lys Trp Ala Trp Val Glu 
210 215 220 

Asn Gly Thr Phe Ala Thr Val Gly Ser Ala Pro Tyr Arg Asp Leu Ser 
225 230 235 240 

Pro Leu Gly Ala Glu Trp Ala Glu Lys Ala Asp Gly Asn Val Arg He 
245 250 255 

Val Gly Cys Ala Val Cys Gly Glu Ser Lys Lys Ala Gin Val Lys Glu 
260 265 270 

Gin Leu Ala Arg Lys He Glu Trp Leu Pro Ser Ser Ala Gin Ala Leu 
275 280 285 

Gly He Arg Asn His Tyr Arg His Pro Glu Glu His Gly Ser Asp Arg 
290 295 300 

Trp Phe Asn Ala Leu Gly Ser Arg Arg Phe Ser Arg Asn Ala Cys Val 
305 310 315 320 

Val Val Ser Cys Gly Thr Ala Val Thr Val Asp Ala Leu Thr Asp Asp 
325 330 335 

Gly His Tyr Leu Gly Gly Thr He Met Pro Gly Phe His Leu Met Lys 
340 345 350 

Glu Ser Leu Ala Val Arg Thr Ala Asn Leu Asn Arg Pro Ala Gly Lys 
355 360 365 



Arg Tyr Pro Phe Pro Thr Thr Thr Gly Asn Ala Val Ala Ser Gly Met 
370 375 380 



OGZ-001 



-29- 



Met Asp Ala Val 
385 

Glu Lys Asn Gly 



Gly Ala Ala Lys 

420 

Asn Thr Val Arg 
435 

Leu lie Ala Ala 
450 



Cys Gly Ser lie 
390 

Ala Gly Lys Pro 
405 

Val Ala Glu Ala 



Val Ala Asp Asn 
440 

Glu Gly Gly Glu 
455 



Met Met Met His 
395 

Val Asp Val lie 
410 

Leu Pro Pro Ala 
425 

Leu Val He His 



Ser Glu His Ala 
460 



Gly Arg Leu Lys 
400 

lie Thr Gly Gly 
415 

Phe Leu Ala Glu 
430 

Gly Leu Leu Asn 
445 



<210> 40 
<211> 1383 
<212> DNA 

<213> Neisseria gonorrhoeae 



<400> 40 

ttgggcgagt gcctgatgtt cagtttcgga 
ggttcgctgt cgcctgttgc ggcacttgcg 
gaaacgcaaa tcaagtggcc aaacgatttg 
ctgattgaaa cagtcagggc gggcggtaaa 
ttcgtgctgc ccaaggaagt ggaaaacgcc 
tcgcggcggg gcaatgccga tgccgccgta 
gcggrgttgg aacaatatgc ggaagaaggg 
gccaaccgcg accacggcaa ggcggtattg 
ggcacggtta aaggcgtgga cggacgaggc 
cagacggtcg tcagcggcga aatcagcctg 
aagcggccgg attcggaacg ttttttgctg 
gcgtgggtgg aaaacggcac gttcgcaacc 
cctttgggcg oggagtgggc ggaaaaggcg 
gtgtgcggag aatccaaaaa ggcacaagtg 
ctgccgtctt ccgcacaggc tttgggcata 
ggttccgacc gttggttcaa cgccttgggc 
gtcgtcagtt gcggcacggc ggtaacggtt 
ggcggaacca tcatgcccgg cttccacctg 
aacctcaacc gccccgccgg caaacgttac 
gcaagcggca tgatggacgc ggtttgcggc 
gaaaaaaacg gcgcgggcaa gcctgtcgat 
gtcgccgaag ccctgccgcc tgcatttttg 
ctcgtcatcc acgggctgct gaacctgatt 
taa 



tgggcgtttg accgcccgca gtatgagttg 60 
tgccggcgcg ctttggggtg tttgggtttg 120 
gtcgtcggac gcgacaaatt gggcggcatt 180 
acggttgccg tggtcggtat cggcatcaat 240 
gcttccgtgc agtcgctgtt tcagacggca 300 
ttgctggaaa cattgcttgc ggaactgggc 360 
ttcgcgccat ttttaaatga gtatgaaacg 420 
ctgttgcgcg acggcgaaac cgtgtgcgaa 480 
gttctgcact tggaaacggc agaaggcgaa 54 0 
cggcccgaca acaggtcggt ttccgtgccg 600 
ttggaaggcg ggaacagccg gctcaagtgg 660 
gtgggcagcg cgccgtaccg cgatttgtcg 720 
gatggaaatg tccgcatcgt cggttgcgcc 780 
aaggaacagc tcgcccgaaa aatcgagtgg 840 
cgcaaccact accgccaccc cgaagaacac 900 
agccgccgct tcagccgcaa cgcctgcgtc 960 
gacgcgctca ccgatgacgg acattatctc 1020 
atgaaagaat cgctcgccgt ccgaaccgcc 1080 
cctttcccga ccacaacggg caacgccgtc 3 140 
tcgataatga tgatgcacgg ccgtttgaaa 1200 
gtcatcatta ccggcggcgg cgcggcgaaa 1260 
gcggaaaata ccgtgcgcgt ggcggacaac 1320 
gccgccgaag gcggggaatc ggaacacgct 1380 

1383 



<210> 41 
<211> 244 
<212> PRT 

<213> Porphyromonas gingivalis 



<400> 41 

Met Ser Phe Asn Leu lie Val Asp Gin Gly Asn Ser Ala Cys Lys Val 

15 10 15 

Ala Phe Val Arg Asn Asn Ser lie Glu Ser lie Ser Phe Leu Pro Gly 

20 25 30 

Lys Ala Gly Gin Ala Leu Ser His Leu Val Ala Pro His Arg Phe Asp 
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35 

Lys Ala lie Tyr 
50 

He Val Arg Ser 
65 

Pro Val Pro Leu 



Arg Leu Ala Ala 
100 

Leu Leu Val He 
115 

Ala Glu Gly He 
130 

Arg Phe Lys Ala 
145 

Pro Ser Gly He 



40 

Ser Ser Val Gly 
55 

Cys Ala Ala Ala 
70 

Arg Leu Gin Tyr 
85 

Val Val Gly Ala 



Asp Ala Gly Thr 
120 

Tyr Leu Gly Gly 
135 

Leu His Leu Phe 
150 

Ser Pro Lys He 
165 



Leu Pro Asp Glu 
60 

Ser Leu Met Met 
75 

Asp Arg Arg Thr 
90 

His Ser Leu Tyr 
105 

Ala He Thr Tyr 



Asn He Ser Pro 
140 

Thr Gly Arg Leu 
155 

Ala Glu Tyr Gly 
170 



45 

Glu Ala Glu Ala 



Gly Thr Glu Thr 
80 

Leu Gly Ala Asp 
95 

Pro Asn Thr Glu 
110 

Glu Arg Val Ser 
125 

Gly Leu His Leu 



Pro Leu He Asp 
160 

Ser Ser Thr Glu 
175 



Glu Ala He Thr Ala Gly Val He 
180 

Arg Tyr He Asp Asp Leu His Ala 
195 200 

Leu Thr Gly Gly Asp Ala Asn Tyr 
210 215 

He Leu He His Pro Asp Leu Val 
225 230 



His Gly Leu Ala Gly Glu He Asp 
185 190 

Lys Glu Gly Arg Ser Ala Val He 
205 

Leu Ala Arg He He Arg Ser Gly 
220 

Leu Leu Gly Leu Asn Arg He Leu 

235 240 



Glu Tyr Asn Val 



<210> 42 
<211> 735 
<212> DNA 

<213> Porphyromonas gingivalis 



<400> 42 

atgtccttca 

aataatagta 

ctcgtcgctc 

gaggctgaag 

cccgtacccc 

gtggtcggag 

gccatcactt 

ggtctccact 

ccttccggta 

gccggagtaa 
aaagaggggc 
ataagaagcg 
gaatataatg 



atctgatcgt 
tagagagcat 
ctcaccgttt 
ctattgtgag 
ttcgcctgca 
cgcatagtct 
atgaacgagt 
tgcgcttcaa 
tctctccgaa 

ttcatggcct 
ggtctgccgt 
gaatactaat 
tataa 



cgatcaaggc 
ttcctttctg 
cgacaaggct 
aagttgtgca 
atatgatcgc 
ctatccgaat 
atccgctgaa 
ggctcttcat 
aatagccgag 

ggcaggggag 
tatactgacc 
tcatcccgat 



aattctgcct 
ccgggaaaag 
atctactcat 
gctgcttcct 
cgcactttgg 
accgaattgc 
gggatctatc 
ctctttacgg 
tatggctcct 

atagacagat 
ggaggagatg 
ttagtacttt 



gtaaggttgc 
ccggacaggc 
ctgtggggct 
tgatgatggg 
gtgccgaccg 
tggtgatcga 
tcggtggcaa 
gcaggctccc 
cgaccgaaga 

atattgacga 
ccaactattt 
tgggcctaaa 



tttcgtccga 
actcagccat 
tcccgacgaa 
gactgagacc 
actggctgcg 
cgccggtact 
tatatcgccc 
cctgattgat 
agcgatcaca 

tctgcacgct 
ggcacggatt 
tagaatttta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

735 
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<210> 43 
<211> 592 
<212> PRT 

<213> Neisseria meningitidis 
<400> 43 

Met Thr Val Leu Lys Leu Ser His Trp Arg Val Leu Ala Glu Leu Ala 
15 10 15 

Asp Gly Leu Pro Gin His Val Ser Gin Leu Ala Arg Met Ala Asp Met 
20 25 30 

Lys Pro Gin Gin Leu Asn Gly Phe Trp Gin Gin Met Pro Ala His lie 
35 40 45 

Arg Gly Leu Leu Arg Gin His Asp Gly Tyr Trp Arg Leu Val Arg Pro 
50 55 60 

Leu Ala Val Phe Asp Ala Glu Gly Leu Arg Glu Leu Gly Glu Arg Ser 
65 70 75 80 

Gly Phe Gin Thr Ala Leu Lys His Glu Cys Ala Ser Ser Asn Asp Glu 
85 90 95 

lie Leu Glu Leu Ala Arg lie Ala Pro Asp Lys Ala His Lys Thr lie 
100 105 110 

Cys Val Thr His Leu Gin Ser Lys Gly Arg Gly Arg Gin Gly Arg Lys 
115 120 125 

Trp Ser His Arg Leu Gly Glu Cys Leu Met Phe Ser Phe Gly Trp Val 
130 135 140 

Phe Asp Arg Pro Gin Tyr Glu Leu Gly Ser Leu Ser Pro Val Ala Ala 
145 150 155 160 

Val Ala Cvs Arg Arg Ala Leu Ser Arg Leu Gly Leu Asp Val Gin lie 
165 170 175 

Lys Trp Pro Asn Asp Leu Val Val Gly Arg Asp Lys Leu Gly Gly lie 
180 185 190 

Leu lie Glu Thr Val Arg Thr Gly Gly Lys Thr Val Ala Val Val Gly 
195 200 205 

He Gly He Asn Phe Val Leu Pro Lys Glu Val Glu Asn Ala Ala Ser 
210 215 220 

Val Gin Ser Leu Phe Gin Thr Ala Ser Arg Arg Gly Asn Ala Asp Ala 
225 230 235 240 

Ala Val Leu Leu Glu Thr Leu Leu Val Glu Leu Asp Ala Val Leu Leu 
245 250 255 

Gin Tyr Ala Arg Asp Gly Phe Ala Pro Phe Val Ala Glu Tyr Gin Ala 
260 265 270 



Ala Asn Arg Asp His Gly Lys Ala Val Leu Leu Leu Arg Asp Gly Glu 
275 280 285 
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Thr Val Phe Glu Gly Thr Val Lys Gly Val Asp Gly Gin Gly Val Leu 
290 295 300 

His Leu Glu Thr Ala Glu Gly Lys Gin Thr Val Val Ser Gly Glu lie 
305 310 315 320 

Ser Leu Arg Ser Asp Asp Arg Pro Val Ser Val Pro Lys Arg Arg Asp 
325 330 335 

Ser Glu Arg Phe Leu Leu Leu Asp Gly Gly Asn Ser Arg Leu Lys Trp 
340 345 350 

Ala Trp Val Glu Asn Gly Thr Phe Ala Thr Val Gly Ser Ala Pro Tyr 
355 360 365 

Arg Asp Leu Ser Pro Leu Gly Ala Glu Trp Ala Glu Lys Ala Asp Gly 
370 375 380 

Asn Val Arg He Val Gly Cys Ala Val Cys Gly Glu Phe Lys Lys Ala 
385 390 395 400 

Gin Val Gin Glu Gin Leu Ala Arg Lys lie Glu Trp Leu Pro Ser Ser 
405 410 415 

Ala Gin Ala Leu Gly He Arg Asn His Tyr Arg His Pro Glu Glu His 
420 425 430 

Gly Ser Asp Arg Trp Phe Asn Ala Leu Gly Ser Arg Arg Phe Ser Arg 
435 440 445 

Asn Ala Cys Val Val Val Ser Cys Gly Thr Ala Val Thr Val Asp Ala 
450 455 460 

Leu Thr Asp Asp Gly His Tyr Leu Gly Gly Thr He Met Pro Gly Phe 
465 470 475 480 

His Leu Met Lys Glu Ser Leu Ala Val Arg Thr Ala Asn Leu Asn Arg 
485 490 495 

His Ala Gly Lys Arg Tyr Pro Phe Pro Thr Thr Thr Gly Asn Ala Val 
500 505 510 

Ala Ser Gly Met Met Asp Ala Val Cys Gly Ser Val Met Met Met His 
515 520 525 

Gly Arg Leu Lys Glu Lys Thr Gly Ala Gly Lys Pro Val Asp Val He 
530 535 540 

He Thr Gly Gly Gly Ala Ala Lys Val Ala Glu Ala Leu Pro Pro Ala 
545 550 555 560 

Phe Leu Ala Glu Asn Thr Val Arg Val Ala Asp Asn Leu Val lie Tyr 
565 570 575 

Gly Leu Leu Asn Met He Ala Ala Glu Gly Arg Glu Tyr Glu His He 
580 585 590 



<210> 44 
<211> 1779 
<212> DNA 
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<213> Neisseria meningitidis 
<400> 44 

atgacggttt tgaagctttc gcactggcgg 
caacacgtct cgcaactggc gcgtatggcg 
tggcagcaga tgccggcgca catacgcggg 
ctggtgcgcc cattggcggt tttcgatgcc 
ggttttcaga cggcattgaa gcacgagtgc 
gcgcggattg cgccggacaa ggcgcacaaa 
ggcagggggc ggcaggggcg gaagtggtcg 
tttggctggg tgtttgaccg gccgcagtat 

gtggcgtgtc ggcgcgcctt gtcgcgttta 
gatttggttg tcggacgcga caaattgggc 
ggcaaaacgg ttgccgtggt cggtatcggc 
aatgccgctt ccgtgcaatc gctgtttcag 
gccgtgctgc tggaaacgct gttggtggaa 
gacggatttg cgccttttgt ggcggaatat 
gtattgctgt tgcgcgacgg cgaaaccgtg 
caaggcgttt tgcacttgga aacggcagag 
agcctgcggt ccgacgacag gccggtttcc 
ctgctgttgg acggcggcaa cagccggctc 
gcaaccgtcg gtagcgcgcc gtaccgcgat 
aaggcggatg gaaatgtccg catcgtcggt 
caagtgcagg aacagctcgc ccgaaaaatc 
ggcatacgca accactaccg ccaccccgaa 
ttgggcagcc gccgcttcag ccgcaacgcc 
acggttgacg cgctcaccga tgacggacat 
cacctgatga aagaatcgct cgccgtccga 
cgttatcctt tcccgaccac aacgggcaat 
tgcggctcgg ttatgatgat gcacgggcgt 
gtcgatgtca tcattaccgg cggcggcgcg 
tttttggcgg aaaataccgt gcgcgtggcg 
atgattgocg ccgaaggcag ggaatatgaa 



gtgttggcgg agcttgccga cggtttgccg 60 

gatatgaagc cgcagcagct caacggtttt 120 

ctgttgcgcc aacacgacgg ctattggcgg 180 

gaaggtttgc gcgagctggg ggaaaggtcg 24 0 

gcgtccagca acgacgagat actggaattg 300 

accatatgcg tgacccacct gcaaagtaag 360 

caccgtttgg gcgagtgtct gatgttcagt 420 

gagttgggtt cgctgtcgcc tgttgcggca 480 

ggtttggatg tgcagattaa gtggcccaat 540 
ggcattctga ttgaaacggt caggacgggc 600 
atcaattttg tcctgcccaa ggaagtagaa 660 
acggcatcgc ggcggggcaa tgccgatgcc 720 
ctggacgcgg tgttgttgca atatgcgcgg 78 0 
caggctgcca accgcgacca cggcaaggcg 840 
ttcgaaggca cggttaaagg cgtggacgga 900 
ggcaaacaga cggtcgtcag cggcgaaatc 960 
gtgccgaagc ggcgggattc ggaacgtttt 1020 
aagtgggcgt gggtggaaaa cggcacgttc 108 0 
ttgtcgcctt tgggcgcgga gtgggcggaa 114 0 
tgcgctgtgt gcggagaatt caaaaaggca 1200 
gagtggctgc cgtcttccgc acaggctttg 1260 
gaacacggtt ccgaccgctg gttcaacgcc 1320 
tgcgtcgtcg tcagttgcgg cacggcggta 1380 
tatctcgggg gaaccatcat gcccggtttc 1440 
accgccaacc tcaaccggca cgccggtaag 1500 
gccgtcgcca gcggcatgat ggatgcggtt 1560 
ttgaaagaaa aaaccggggc gggcaagcct 1620 
gcaaaagttg ccgaagccct gccgcctgca 1680 
gacaacctcg tcatttacgg gttgttgaac 1740 
catatttaa 1779 



<210> 45 
<211> 262 
<212> PRT 

<213> Bacillus anthracis 



<400> 45 

Met lie Phe Val Leu Asp Val Gly Asn Thr Asn Ala Val Leu Gly Val 
15 10 15 

Phe Glu Glu Gly Glu Leu Arg Gin His Trp Arg Met Glu Thr Asp Arg 
20 25 30 

His Lys Thr Glu Asp Glu Tyr Gly Met Leu Val Lys Gin Leu Leu Glu 
35 40 45 

His Glu Gly Leu Ser Phe Glu Asp Val Lys Gly lie lie Val Ser Ser 
50 55 60 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Glu Lys Tyr 
65 70 75 80 

Phe Lys lie Lys Pro Leu Val Val Gly Pro Gly lie Lys Thr Gly Leu 
85 90 95 



Asn lie Lys Tyr Glu Asn Pro Arg Glu Val Gly Ala Asp Arg lie Val 
100 105 110 
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Asn Ala Val Ala 
115 

Asp Phe Gly Thr 
130 

Tyr Met Gly Gly 
145 

Leu Tyr Ser Arg 



Ser Ser Val Val 
180 

Leu Tyr Gly Tyr 
195 

Glu Glu Ala Lys 
210 

Lys Leu He Ser 
225 

Leu Thr Leu Lys 



Gly lie His Leu 
120 

Ala Thr Thr Tyr 
135 

Val lie Thr Pro 
150 

Ala Ala Lys Leu 
165 

Gly Lys Asn Thr 



Val Gly Gin Val 
200 

Gin Glu Pro Lys 
215 

Glu Glu Ser Asn 
230 

Gly Leu Tyr Met 

245 



Tyr Gly Ser Pro 



Cys Tyr He Asn 
140 

Gly lie Met lie 
155 

Pro Arg lie Glu 
170 

Val Ser Ala Met 
185 

Glu Gly He Val 



Val He Ala Thr 
220 

Val He Asp Val 
235 

Leu Tyr Glu Arg 
250 



Leu He He Val 
125 

Glu Glu Lys His 



Ser Ala Glu Ala 
160 

He Thr Lys Pro 
175 

Gin Ser Gly lie 
190 

Lys Arg Met Lys 
205 

Gly Gly Leu Ala 



Val Asp Pro Phe 
240 

Asn Ala Asn Leu 
255 



Gin His Glu Lys Gly Glu 
260 



<210> 46 
<211> 789 
<212> DNA 

<213> Bacillus anthracis 



<400> 46 

atgatttttg 

gaacttcgtc 

atgcttgtga 

atcgtatctt 

tttaaaatta 

gaaaatccac 

tat ggaagtc 

gaagaaaagc 

ttatatagta 

gggaagaata 

gaaggtattg 

ggtggattgg 

ttaacattaa 

ggtgaataa 



tattggatgt 
aacattggcg 
agcagttgct 
cagtcgtgcc 
agccgcttgt 
gtgaagtagg 
cgcttattat 
attatatggg 
gagccgcaaa 
cggtaagtgc 
ttaagcgcat 
cgaaattaat 
aaggtttgta 



agggaacaca 
catggaaaca 
tgagcatgag 
accaattatg 
agtaggtcct 
tgcggatcga 
tgtcgatttt 
tggagttatt 
acttcctcgt 
gatgcaatct 
gaaagaggaa 
ttcagaagaa 
tatgttatac 



aatgctgtac 
gatcgtcata 
ggtctttcgt 
tttgctttag 
ggaataaaaa 
atcgtaaatg 
ggtacggcta 
acaccgggaa 
attgaaatta 
ggtattcttt 
gctaaacaag 
t cgaatgtga 
gagcggaatg 



tgggcgtgtt 
agacagaaga 
ttgaagatgt 
agcgcatgtg 
cggggctaaa 
cagtagcagg 
ctacatattg 
ttatgatttc 
caaaaccaag 
atggttatgt 
aaccgaaagt 
ttgatgttgt 
caaatttaca 



tgaagagggg 
tgaatatgga 
gaaaggtatt 
tgaaaagtat 
tattaaatat 
gatccactta 
ttatattaac 
agcagaggct 
cagtgtagtt 
tggacaagtg 
tattgcaaca 
agatccattt 
gcatgagaaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

789 



<210> 47 
<211> 254 
<212> PRT 

<213> Bacillus halodurans 



<400> 47 

Met He Leu Val He Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 
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Tyr Gin Asp Glu Thr Leu Val His His Trp Arg Leu Ala Thr Ser Arg 
20 25 30 

Gin Lys Thr Glu Asp Glu Tyr Ala Met Thr Val Arg Ser Leu Phe Asp 
35 40 45 

His Ala Gly Leu Gin Phe Gin Asp lie Asp Gly lie Val lie Ser Ser 
50 55 60 

Val Val Pro Pro Met Met Phe Ser Leu Glu Gin Met Cys Lys Lys Tyr 
65 70 75 80 

Phe His Val Thr Pro Met lie lie Gly Pro Gly lie Lys Thr Gly Leu 
85 90 95 

Asn lie Lys Tyr Asp Asn Pro Lys Glu Val Gly Ala Asp Arg lie Val 
100 105 110 

Asn Ala Val Ala Ala He Glu Leu Tyr Gly Tyr Pro Ala lie Val Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Leu He Asn Glu Lys Lys Gin 
130 135 140 

Tyr Ala Gly Gly Val lie Ala Pro Gly lie Met lie Ser Thr Glu Ala 
145 150 155 160 

Leu Tyr His Arg Ala Ser Lys Leu Pro Arg He Glu lie Ala Lys Pro 
165 170 175 

Lys Gin Val Val Gly Thr Asn Thr He Asp Ser Met Gin Ser Gly He 
180 155 190 

Phe Tyr Gly Tyr Val Ser Gin Val Asp Gly Val Val Lys Arg Met Lys 
195 200 205 

Ala Gin Ala Glu Ser Glu Pro Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 

Lys Leu He Gly Thr Glu Ser Glu Thr He Asp Val He Asp Ser Phe 
225 230 235 240 

Leu Thr Leu Lys Gly Leu Gin Leu lie Tyr Lys Lys Asn Val 
245 250 



<210> 48 
<211> 765 
<212> DNA 

<213> Bacillus halodurans 

<400> 48 

atgatacttg tcattgatgt tggaaataca 
acgttagtgc atcattggcg gctagcgacg 
atgacggtgc gttctctctt tgatcatgca 
gtcatttcat ctgttgtccc accgatgatg 
tttcatgtca ctcctatgat tattgggcct 
gacaatccaa aagaggttgg ggccgatcga 
tatggctacc ctgccattgt cgttgatttt 
gaaaaaaaac aatatgcagg gggagtcatt 
ttgtatcatc gcgcatcaaa attgccacgg 



aatactgtgt taggggtcta ccaagatgaa 60 
agtaggcaaa agaccgagga tgagtatgca 120 
ggtctacagt ttcaagacat agacggcatt 180 
ttttccttag agcaaatgtg caaaaaatac 240 
ggaattaaga caggcttaaa tattaagtat 300 
atcgttaatg cagttgcagc gattgagtta 360 
ggaacagcaa caacatattg cttaattaat 420 
gctcctggaa tcatgatctc aacagaagcg 480 
attgaaatag cgaagccgaa acaagtcgta 540 
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gggacaaata cgattgattc gatgcaatca ggaatcttct acgggtatgt gagccaagtc 600 

gatggtgttg tgaaacgaat gaaggctcaa gcagaaagtg aaccgaaagt cattgcaact 660 

ggtgggcttg cgaagttaat cggaaccgag tcggaaacca ttgatgtaat cgattcgttt 720 

ttaacattaa aaggattgca actcatttat aagaagaatg tctga 7 65 

<210> 49 
<211> 258 
<212> PRT 

<213> Bacillus stearothermophilus 
<400> 49 

Met He Phe Val Leu Asp Val Gly Asn Thr Asn Thr Val Leu Gly Val 
15 10 15 

Tyr Asp Gly Asp Glu Leu Lys His His Trp Arg lie Glu Thr Ser Arg 
20 25 30 

Ser Lys Thr Glu Asp Glu Tyr Gly Met Met He Lys Ala Leu Leu Asn 
35 40 45 

His Val Gly Leu Gin Phe Ser Asp He Arg Gly He He He Ser Ser 
50 55 60 

Val Val Pro Pro lie Met Phe Ala Leu Glu Arg Met Cys Leu Lys Tyr 
65 70 75 80 

Phe His He Lys Pro Leu He Val Gly Pro Gly He Lys Thr Gly Leu 
85 90 95 

Asp He Lys Tyr Asp Asn Pro Arg Glu Val Gly Ala Asp Arg lie Val 
100 105 110 

Asn Ala Val Ala Gly He His Leu Tyr Gly Ser Pro Leu He He Val 
115 " 120 125 

Asp Phe Gly Thr Ala Thr Thr Tyr Cys Tyr lie Asn Glu His Lys Gin 
130 135 140 

Tyr Met Gly Gly Ala He Ala Pro Gly He Met He Ser Thr Glu Ala 
145 150 155 160 

Leu Phe Ala Arg Ala Ala Lys Leu Pro Arg He Glu He Ala Arg Pro 
165 170 175 

Asp Asp He He Gly Lys Asn Thr Val Ser Ala Met Gin Ala Gly He 
180 185 190 

Leu Tyr Gly Tyr Val Gly Gin Val Glu Gly He Val Ser Arg Met Lys 
195 200 205 

Ala Lys Ser Lys lie Pro Pro Lys Val He Ala Thr Gly Gly Leu Ala 
210 215 220 

Pro Leu lie Ala Ser Glu Ser Asp lie lie Asp Val Val Asp Pro Phe 
225 230 235 240 

Leu Thr Leu Thr Gly Leu Lys Leu Leu Tyr Glu Lys Asn Thr Glu Lys 
245 250 255 



Lys Gly 
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<210> 50 
<211> 777 
<212> DNA 

<213> Bacillus stearothermophilus 



<400> 50 

atgatttttg 

gaactgaaac 

atgatgatca 

atcatttcct 

ttccatatca 

gacaatccgc 

tacggcagtc 

gaacataaac 

ctgtttgcgc 

ggcaaaaata 

gaaggcatcg 

ggcggtttgg 

ttgacgctga 



tattggacgt 
atcattggcg 
aagcgctctt 
cggtcgtgcc 
aaccgctcat 

gtgaggtggg 
cgctgattat 
aatatatggg 
gggcggcgaa 

cggtcagcgc 
tgtcgcgaat 
ctccgctcat 
ctggcttaaa 



cggcaataca 
cattgaaaca 
gaaccatgtc 
gccgattatg 
cgtcggtccg 
cgccgaccgg 
cgtcgatttt 
aggggccatt 
attgccgcgc 
catgcaagcc 
gaaggcgaaa 
tgccagcgaa 
attgttgtac 



aacacggtgt 
agccgctcga 

ggcttgcagt 
tttgctcttg 
ggcattaaaa 
attgtcaacg 
ggcacggcga 
gccccgggaa 
attgaaatcg 
ggtattttat 
agcaaaatcc 
tcggacatca 
gagaaaaaca 



taggggtgta 
aaacggaaga 
tttccgacat 
aacgcatgtg 
ccgggctcga 
cggttgccgg 
cgacgtattg 
ttatgatctc 
cccgcccgga 
acggttatgt 
cgccgaaggt 
tcgatgtcgt 
ccgagaaaaa 



tgacggggac 
cgaatacggc 
tcgaggcatc 
tctaaaatat 
catcaaatat 
catccatttg 
ttatattaat 
gacagaggct 
tgatatcatc 
cggacaagtg 
gattgctact 
tgatccgttt 
aggatga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

777 



<210> 51 
<211> 260 
<212> PRT 

<213> Caulobacter crescentus 
<400> 51 

Met Leu Leu Ala lie Glu Gin Gly Asn Thr Asn Thr Met Phe Ala lie 
15 10 15 

His Asp Gly Ala Ser Trp Val Ala Gin Trp Arg Ser Ala Thr Glu Ser 
20 25 30 

Thr Arg Thr Ala Asp Glu Tyr Val Val Trp Leu Ser Gin Leu Leu Ser 
35 40 45 

Met Gin Gly Leu Gly Phe Arg Ala lie Asp Ala Val lie lie Ser Ser 
50 55 60 

Val Val Pro Gin Ser He Phe Asn Leu Arg Asn Leu Ser Arg Arg Tyr 
65 70 75 80 

Phe Asn Val Glu Pro Leu Val He Gly Glu Asn Ala Lys Leu Gly He 
85 90 95 

Asp Val Arg lie Glu Lys Pro Ser Glu Ala Gly Ala Asp Arg Leu Val 
100 105 110 

Asn Ala He Gly Ala Ala Met Val Tyr Pro Gly Pro Leu Val Val He 
115 120 125 

Asp Ser Gly Thr Ala Thr Thr Phe Asp He Val Ala Ala Asp Gly Ala 
130 135 140 

Phe Glu Gly Gly lie lie Ala Pro Gly lie Asn Leu Ser Met Gin Ala 
145 150 155 160 



Leu His Glu Ala Ala Ala Lys Leu Pro Arg He Ala He Gin Arg Pro 
165 170 175 
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Ala Gly Asn Arg lie Val Gly Thr 
180 

Gly Val Phe Trp Gly Tyr He Ser 
195 200 

lie Lys Ala Glu Arg Gly Glu Pro 
210 215 

Val Ala Ser Leu Phe Glu Gly Ala 
225 230 

Ser Asp Leu Thr He Arg Gly Leu 
245 

He Ala Glu Ser 
260 



Asp Thr Val Ser Ala Met Gin Ser 
185 190 

Leu He Glu Gly Leu Val Ala Arg 

205 

Met Thr Val He Ala Thr Gly Gly 
220 

Thr Asp Ser lie Asp His Phe Asp 
235 240 

Leu Glu He Tyr Arg Arg Asn Thr 
250 255 



<210> 52 
<211> 783 
<212> DNA 

<213> Caulobacter crescentus 
<400> 52 

atgctgctgg ccattgagca gggcaacacc aacaccatgt tcgccattca tgatggcgca 60 
tcgtgggtcg cgcagtggcg gtcagcgacc gaaagcacgc gcacggccga tgagtacgtc 120 
gtctggcttt cgcaactgct gtcgatgcag gggcttggct tccgggcgat cgacgccgtg 180 
atcatttcca gcgtcgtgcc gcagtcgatc ttcaatctgc gcaacctgag ccgccgctac 240 
ttcaacgtcg agcctctggt catcggtgag aacgccaagc tgggcattga tgtccgcatc 300 
gagaaaccct ccgaggccgg cgccgaccgc ctggtcaacg ccattggcgc ggcgatggtc 360 
tatccgggtc cgctggtcgt gatcgacagc ggcaccgcga cgacgttcga catcgtggcc 420 
gccgacggcg ccttcgaggg cgggattatc gcgcccggta tcaacctgtc gatgcaggct 480 
ctgcacgagg cggcggcgaa gctgccgcgc atcgccatcc agcgtcccgc cggtaacagg 540 
atcgtgggca cggacacggt ctccgccatg cagtccggcg tcttctgggg ctatatttcg 600 
ctgatcgaag gcctcgtcgc gcggatcaag gccgagcgcg gcgagcctat gaccgttatc 660 
gccacgggtg gcgtcgcctc gctgttcgag ggcgcgaccg acagcattga ccacttcgac 720 



tctgatctga cgatccgggg tcttctcgaa atctaccgcc gaaacaccat cgccgagtcc 
tga 

<210> 53 
<211> 257 
<212> PRT 

<213> Chlorobium tepidum 
<400> 53 

Met Arg Leu Val Val Asp He Gly Asn Thr Ser Thr Thr Leu Ala He 



1 



10 15 



Phe Thr Gly Asp Glu Glu Pro Ser Val Glu Ser Val Pro Ser Ala Leu 
20 25 30 

Phe Ala Asp Ser Ser Thr Met Arg Glu Val Phe Gly Asn Met Ala Arg 
35 40 45 

Lys His Gly Glu Pro Gin Ala He Ala He Cys Ser Val Val Pro Ser 
50 55 60 

Ala Thr Ala Val Gly Ser Ala Leu Leu Glu Ser Leu Phe Ser Val Pro 
65 70 75 80 



780 
783 
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Val Leu Thr He Cys Cys Lys Leu Arg Phe Pro Phe Arg Leu Asp Tyr 
85 90 95 

Ala Thr Pro His Thr Phe Gly Ala Asp Arg Leu Ala Leu Cys Ala Trp 
100 105 110 

Ser Arg His Leu Phe Ser Glu Lys Pro Val lie Ala Val Asp lie Gly 
115 120 125 

Thr Ala He Thr Phe Asp Val Leu Asp Thr Val Gly Asn Tyr Arg Gly 
130 135 140 

Gly Leu lie Met Pro Gly He Asp Met Met Ala Gly Ala Leu His Ser 
145 150 155 160 

Arg Thr Ala Gin Leu Pro Gin Val Arg He Asp Arg Pro Glu Ser Leu 
165 170 175 

Leu Gly Arg Ser Thr Thr Glu Cys He Lys Ser Gly Val Phe Trp Gly 
180 185 190 

Val Val Lys Gin He Gly Gly Leu Val Asp Ala He Arg Gly Asp Leu 
195 200 205 

Val Arg Asp Phe Gly Glu Ser Thr Val Glu Val lie Val Thr Gly Gly 
210 215 220 

Asn Ser Arg He He Val Pro Glu He Gly Pro Val Ser Val He Asp 
225 " 230 235 240 

Glu Leu Ala Val Leu Arg Gly Ser Asp Leu Leu Leu Arg Met Asn Met 
245 250 255 



Pro 



<210> 54 
<211> 774 
<212> DNA 

<213> Chlorobium tepidum 



<400> 54 

gtgcggctgg 

gaagagccgt 

gaagtgtttg 

gtggtgcctt 

gtgctgacca 

accttcggcg 

ccggttatcg 

aattatcgcg 

agaaccgccc 

acgaccgaat 

gtggacgcca 

gtcaccggcg 

gaactcgctg 



tcgttgacat 
cggtcgagtc 
gcaacatggc 
ccgctaccgc 
tctgctgtaa 
cggatcgcct 
ccgtcgatat 
gtggtctcat 
agcttcccca 
gcatcaaaag 
ttcgcggcga 
gcaatagcag 
tcctgcgcgg 



cggcaatacc 
ggtaccgagt 
ccggaagcac 
cgtcggttcg 
gctccgtttt 
tgccctgtgc 
cggcacggcc 
catgccgggt 
ggtgcgcatc 
cggagttttc 
ccttgtacgc 
gattatcgtt 
cagcgatctt 



agcacgacgt 
gcgttgtttg 
ggcgagccac 
gcgcttctcg 
ccttttcgtc 
gcatggagcc 
atcaccttcg 
atcgacatga 
gacaggccgg 
tggggagtgg 
gactttggcg 
ccggagatcg 
ttgctgcgga 



tggcgatttt 
ccgattccag 
aggccatcgc 
aatcactttt 
tcgactacgc 
gacatctctt 
acgtgctcga 
tggccggagc 
agagccttct 
tcaaacagat 
agtcaacggt 
gccctgtcag 
tgaatatgcc 



caccggtgat 
cacaatgcgc 
catttgcagc 
ctccgtaccg 
aaccccgcac 
ttctgaaaaa 
cacggtgggg 
gcttcattcg 
cgggcgctcg 
cggcggcctc 
cgaagtgatt 
tgttatcgac 
gtga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

774 



<210> 55 
<211> 256 
<212> PRT 

<213> Clostridium difficile 
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<400> 55 

Met Leu Leu Val Phe Asp Val Gly Asn Thr Asn Met Val Leu Gly He 
15 10 15 

Tyr Lys Gly Asp Lys Leu Val Asn Tyr Trp Arg lie Lys Thr Asp Arg 
20 25 30 

Glu Lys Thr Ser Asp Glu Tyr Gly He Leu He Ser Asn Leu Phe Asp 
35 40 45 

Tyr Asp Asn Val Asn He Ser Asp He Asp Asp Val He He Ser Ser 
50 55 60 

Val Val Pro Asn Val Met His Ser Leu Glu Asn Phe Cys He Lys Tyr 
65 70 75 80 

Cys Lys Lys Gin Pro Leu He Val Gly Pro Gly He Lys Thr Gly Leu 
85 90 95 

Asn He Lys Tyr Asp Asn Pro Lys Gin Val Gly Ala Asp Arg He Val 
100 105 110 

Asn Ala Val Ala Gly lie Glu Lys Tyr Gly Ala Pro Ser lie Leu Val 
115 120 125 

Asp Phe Gly Thr Ala Thr Thr Phe Cys Ala He Ser Glu Lys Gly Glu 
130 135 140 

Tyr Leu Gly Gly Thr He Ala Pro Gly He Lys He Ser Ser Glu Ala 
145 150 155 160 

Leu Phe Gin Ser Ala Ser Lys Leu Pro Arg Val Glu Leu Ala Lys Pro 
165 170 175 

Gly Met Thr He Cys Lys Ser Thr Val Ser Ala Met Gin Ser Gly He 
180 185 190 

He Tyr Gly Tyr Val Gly Leu Val Asp Lys He He Ser He Met Lys 
195 200 205 

Lys Glu Leu Asn Cys Asp Asp Val Lys Val He Ala Thr Gly Gly Leu 
210 215 220 

Ala Lys Leu He Ala Ser Glu Thr Lys Ser He Asp Tyr Val Asp Gly 
225 230 235 240 

Phe Leu Thr Leu Glu Gly Leu Arg He He Tyr Glu Lys Asn Gin Glu 
245 250 255 



<210> 56 
<2H> 771 
<212> DNA 

<213> Clostridium difficile 
<400> 56 

atgcttctag tatttgatgt tggaaatact aatatggttt taggtatata taaaggtgac 
aaattagtta attactggag aattaaaaca gatagggaaa aaacgtctga tgaatatgga 120 
atcctgataa gtaacctatt tgattatgat aatgtgaata taagtgatat tgatgatgtt 180 
ataatatcat ctgtagttcc gaatgttatg cattctcttg aaaacttttg tataaagtac 240 



60 
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tgtaaaaaac agccattaat agtaggtcca ggcataaaaa caggtctaaa tataaaatat 300 
gataatccaa aacaagttgg ggcagataga atagttaatg ctgtagcagg gatagaaaag 360 
tatggagcac caagtatact tgttgatttt ggaacagcaa ctacattttg tgctatctct 420 
gaaaaaggtg aatatttggg tggaacaata gcaccaggaa taaaaatatc tagtgaggcg 480 
ttatttcaaa gtgcgtctaa attacctaga gtagaattag ctaagccagg tatgactatt 540 
tgtaagagta ctgtatcagc catgcaatct ggaataattt atggatatgt tggtttagtt 600 
gacaaaataa taagtattat gaagaaagaa ttgaattgtg atgatgttaa ggttatagct 660 
acaggtggat tagctaaact gattgcttca gagacgaaaa gtatagatta tgtagatggt 720 
tttttaacac tagaaggatt gagaataata tatgaaaaaa accaagaata a 771 

<210> 57 
<211> 219 

<212> PRT 

<213> Dehalococcoid.es ethenogenes 
<400> 57 

Met Ser Glu Lys Leu Val Ala Val Asp lie Gly Asn Thr Ser Val Asn 
15 10 15 

He Gly He Phe Glu Gly Glu Lys Leu Leu Ala Asn Trp His Leu Gly 
20 25 30 

Ser Val Ala Gin Arg Met Ala Asp Glu Tyr Ala Ser Leu Leu Leu Gly 
35 40 45 

Leu Leu Gin His Ala Gly Tie His Pro Glu Glu Leu Asn Arg Val lie 
50 55 60 

Met Cys Ser Val Val Pro Pro Leu Thr Thr Thr Phe Glu Glu Val Phe 
65 70 75 80 

Lys Ser Tvr Phe Lys Ala Ala Pro Leu Val Val Gly Ala Gly He Lys 
85 90 95 

Ser Gly Val Lys Val* Arg Met Asp Asn Pro Arg Glu Val Gly Ala Asp 
100 105 110 

Arg He Val Asn Ala Ala Ala Ala Arg Val Leu Tyr Pro Gly Ala Cys 
115 120 125 

He He Val Asp Met Gly Thr Ala Thr Thr Phe Asp Thr Leu Ser Glu 
130 135 140 

Gly Gly Ala Tyr He Gly Gly Ala He Ala Pro Gly He Ala Thr Ser 
145 150 155 160 

Ala Gin Ala He Ala Glu Lys Thr Ser Lys Leu Pro Lys He Glu He 
165 170 175 

He Arg Pro Ala Lys Val He Gly Ser Asn Thr Val Ser Ala Met Gin 
180 185 190 

Ser Gly lie Tyr Phe Gly Tyr He Gly Leu Val Glu Glu Leu Val Arg 
195 200 205 

Arg He Gin Thr Glu Leu Gly Gin Lys Thr Arg 
210 215 



<210> 58 
<211> 659 
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<212> DNA 

<213> Dehalococcoides ethenogenes 
<400> 58 

atgtctgaaa aactggtggc ggtagatatc ggcaatacca gcgtaaatat aggtatattt 60 

gagggcgaaa aactgctggc aaactggcat ctgggttcgg ttgcccagcg tatggctgat 120 

gaatatgcca gtctgctctt aggcctgttg cagcacgccg gtatacaccc ggaagagcta 180 

aacagggtaa tcatgtgcag tgttgtgccg cccctgacca ctacttttga agaggtattt 240 

aaaagctatt tcaaggctgc tcctctggta gtgggtgcag gtataaagag cggggttaag 300 

gtgcgcatgg ataacccccg tgaggttggg gctgaccgca tagtaaatgc cgctgccgcc 360 

agggtgcttt atccgggggc gtgcataata gtggacatgg gtacggccac tacctttgat 420 

accctttccg agggtggggc atatataggc ggggcgattg cacccggtat tgccacctca 480 

gcccaggcta ttgcggaaaa gacttcaaaa ctgcccaaga ttgagataat ccgtcctgcc 540 

aaagttatcg gctctaatac tgtgtcggct atgcagtcag gtatatactt cggttatatc 600 

gggctggtgg aagagctggt caggcggatt caaactgaat tggggcagaa aaccagagt 659 

<210> 59 
<211> 212 
<212> PRT 

<213> Desulf ovibrio vulgaris 
<400> 59 

Met Thr Gin His Phe Leu Leu Phe Asp lie Gly Asn Thr Asn Val Lys 
15 10 15 

He Gly He Ala Val Glu Thr Ala Val Leu Thr Ser Tyr Val Leu Pro 
20 25 30 

Thr Asp Pro Gly Gin Thr Thr Asp Ser He Gly Leu Arg Leu Leu Glu 
35 40 45 

Val Leu Arg His Ala Gly Leu Gly Pro Ala Asp Val Gly Ala Cys Val 
50 55 60 

Ala Ser Ser Val Val Pro Gly Val Asn Pro Leu Tie Arg Arg Ala Cys 
65 70 75 80 

Glu Arg Tyr Leu Tyr Arg Lys Leu Leu Phe Ala Pro Gly Asp He Ala 
85 90 95 

He Pro Leu Asp Asn Arg Tyr Glu Arg Pro Ala Glu Val Gly Ala Asp 
100 105 110 

Arg Leu Val Ala Ala Tyr Ala Ala Arg Arg Leu Tyr Pro Gly Pro Arg 
115 120 125 

Ser Leu Val Ser Val Asp Phe Gly Thr Ala Thr Thr Phe Asp Cys Val 
130 135 140 

Glu Gly Gly Ala Tyr Leu Gly Gly Leu lie Cys Pro Gly Val Leu Ser 
145 150 155 160 

Ser Ala Gly Ala Leu Ser Ser Arg Thr Ala Lys Leu Pro Arg He Ser 
165 170 175 

Leu Glu Val Glu Glu Asp Ser Pro Val He Gly Arg Ser Thr Thr Thr 
180 185 190 



Ser Leu Asn His Gly Phe He Phe Gly Phe Ala Ala Met Thr Glu Gly 
195 200 205 
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Val Leu Ala Ala 
210 



<210> 60 
<211> 639 
<212> DNA 

<213> Desulf ovibrio vulgaris 
<400> 60 

atgacccagc atttcctgct gttcgacatc ggcaacacca acgtcaagat cggcatcgcg 60 

gtggaaaccg ccgtgctgac ttcgtacgtg ctgcccacag accccggcca gacgaccgac 120 

tccatcgggc tgcgcctgct ggaggtgctg cgccatgccg ggctgggacc ggcggacgtg 180 

ggggcctgcg tggccagttc ggtggtgccc ggcgtcaacc cgctgatccg ccgcgcctgc 240 

gaacgttacc tgtatcgcaa gctgctgttc gcccccggcg acatcgccat tccgctggac 300 

aaccgctacg aacggcccgc cgaagtgggc gcggaccggc tggtggcggc ctatgccgcc 360 

cggcggctgt accccggccc ccggtcgctg gtatccgtgg atttcggcac cgccaccacg 420 

tttgactgcg tggaaggggg tgcgtatctt ggtggtttga tctgtcccgg cgtgctgtcg 480 

tccgccgggg cgttgtcgtc gcgcacggcc aagctgccgc gcatcagtct ggaagtggaa 540 

gaggattcgc cggtcatcgg gcggtccacc accaccagcc tgaaccacgg cttcattttc 600 

ggctttgccg ccatgaccga aggggtgctg gccgcctga 639 

<210> 61 
<211> 249 
<212> PRT 

<213> Pseudomonas putida 
<400> 61 

Met He Leu Glu Leu Asp Cys Gly Asn Ser Phe He Lys Trp Arg Val 
15 10 15 

lie His Val Ala Asp Ala Val lie Glu Gly Gly Gly lie Val Asp Ser 
20 25 30 

Asp Gin Ala Leu Val Ala Glu Val Ala Ala Leu Ala Ser Val Arg Leu 
35 40 45 

Thr Gly Cys Arg He Val Ser Val Arg Ser Glu Glu Glu Thr Asp Ala 
50 55 60 

Leu Cys Ala Leu He Ala Gin Ala Phe Ala Val Gin Ala Lys Val Ala 
65 70 75 80 

His Pro Val Arg Glu Met Ala Gly Val Arg Asn Gly Tyr Asp Asp Tyr 
85 90 95 

Gin Arg Leu Gly Met Asp Arg Trp Leu Ala Ala Leu Gly Ala Phe His 
100 105 HO 

Leu Ala Lys Gly Ala Cys Leu Val He Asp Leu Gly Thr Ala Ala Lys 
115 120 125 

Ala Asp Phe Val Ser Ala Asp Gly Glu His Leu Gly Gly Tyr He Cys 
130 135 140 

Pro Gly Met Pro Leu Met Arg Ser Gin Leu Arg Thr His Thr Arg Arg 
145 150 155 160 

He Arg Tyr Asp Asp Ala Ser Ala Glu Arg Ala Leu Ser Ser Leu Ser 
165 170 175 

Pro Gly Arg Ser Thr Val Glu Ala Val Glu Arg Gly Cys Val Leu Met 
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180 

Leu Gin Gly Phe Ala 
195 

Gly Glu Glu Phe Thr 
210 

Arg Ala Ala Leu Pro 
225 

Gly Leu Ala Met Ala 
245 



185 

Tyr Thr Gin Leu Glu 
200 

Val Phe Leu Thr Gly 
215 

Gin Ala Arg Val Val 
230 

Cys Pro Leu Asp 



190 

Gin Ala Arg Val Leu Trp 
205 

Gly Asp Ala Pro Leu Val 
220 

Pro Asp Leu Val Phe Val 
235 240 



<210> 62 
<211> 750 
<212> DNA 

<213> Pseudomonas putida 



<400> 62 

atgattcttg 

gatgctgtga 

gctgcgctcg 

gagaccgatg 

caccctgtcc 

atggatcgtt 

attgacctgg 

ggctacatct 

atccgctatg 

actgtcgaag 

cttgagcagg 

gcgccactgg 

ggcctggcaa 



agctcgattg 
ttgaaggtgg 
cttcagtgcg 
cgctttgcgc 
gtgaaatggc 
ggctggcggc 
gtaccgcggc 
gcccaggtat 
acgatgcctc 
cggtagagcg 
ctcgtgtgct 
tgagggcggc 
tggcttgtcc 



cggtaacagc 
tgggatcgtc 
tctcacgggt 
gttgattgct 
aggtgtgcgc 
gttgggggca 
aaaagcggac 
gccattgatg 
cgcggagcgc 
cggttgcgta 
atggggtgag 
cctgccacag 
attggattga 



ttcatcaagt 
gattccgatc 
tgccgtattg 
caggcatttg 
aatggctatg 
tttcacctgg 
ttcgtttctg 
cgtagccagc 
gcattgagca 
ttgatgctcc 
gagttcaccg 
gcgcgggtcg 



<210> 63 
<211> 241 
<212> PRT 

<213> Thiobacillus f errooxidans 



ggcgggtgat 
aggcgctggt 
tcagtgtgcg 
ccgtgcaggc 
acgactatca 
ccaagggcgc 
cagatggcga 
tgcgcactca 
gcttgtcacc 
agggctttgc 
tgttcctcac 
tgcctgacct 



ccatgttgcc 
ggcggaagtg 
cagcgaagaa 
gaaggttgcc 
gcgcctgggt 
gtgcctggtg 
gcatcttggg 
cacccgtcgg 
aggtcgctcg 
ctacacccag 
tggcggtgat 
ggttttcgtt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

750 



<400> 63 

Met lie Phe lie Ala Val Gly Asn Thr Arg Thr Leu Leu Ala His Thr 
15 10 15 

His Asp Gly Val His Phe Asp Ser Val Ser Val Ala Thr Ser Leu Pro 
20 25 30 

Pro Thr Glu He Leu Gin Gin Pro Gly Leu Thr Trp Leu Ser Ala Pro 
35 40 45 

Asn Arg Glu Pro Val Ala Leu Gly Gly Val Val Pro Ala Ala Leu Ala 
50 55 60 

Ala Trp Arg Glu Ala Leu Ala Thr Ala Glu Val Arg Glu Pro Asp Pro 
65 70 75 80 

Gly Phe Phe Arg Arg Ala Val Pro His Asp Tyr His Pro Pro Glu Ser 
85 90 95 



Leu Gly Phe Asp Arg Arg Cys Cys Leu Leu Ala Ala Ala Met Asp Tyr 
100 105 HO 
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Pro Gly Gin Asp Ser He Val He Asp Met Gly Thr Ala He Thr He 
115 120 125 

Asp Leu Leu Ala Gly Gly His Phe Arg Gly Gly Arg lie Leu Pro Gly 
130 135 140 

He Ala Met Ser Leu Arg Gly Leu His Glu Gly Thr Ala Leu Leu Pro 
145 150 155 160 

Glu Val Val Leu Asn Ala Pro Ala Glu Met Leu Gly Asn Asp Thr Ser 
165 170 175 

Asn Ala lie Gin Ala Gly Val lie His Leu Phe Ala Asp Ala Leu Arg 
180 185 190 

Gly Ala He Thr Asp Phe Arg Gin Tyr Ser Pro Gin Ala Arg He Leu 
195 200 205 

He Thr Gly Gly Asp Ala Glu Arg Trp Gin Pro Gly He Ala Gly Ser 
210 215 220 

Leu Tyr Gin Pro His Leu Leu Leu Arg Gly Phe Tyr Leu Trp lie Arg 
225 230 235 240 

Gly 



<210> 64 
<211> 726 
<212> DNA 

<213> Thiobacillus ferrooxidans 



<400> 64 

atgatcttca 

catttcgaca 

ggcttgacat 

gcggcgcttg 

ggcttttttc 

cgccgttgct 

gacatgggca 

attctgccgg 

gaagtcgtcc 

gccggggtca 

tacagccccc 

atcgctggta 

ggatga 



tcgccgtcgg 
gcgtcagcgt 
ggctcagcgc 
ccgcctggcg 
gccgcgccgt 
gcctgctcgc 
ccgccatcac 
gtatcgccat 
tgaacgcccc 
tccacctctt 
aggcacggat 

gcctgtacca 



caatacccgc 
ggccacttcg 
gccgaaccgg 
ggaagccttg 
gccgcacgac 
cgccgccatg 
catcgacctg 
gagcctgcgc 
agcggaaatg 
tgccgatgcc 
actgatcacc 
gccccatctg 



accctgctgg 
ctgccaccca 
gaacccgtcg 
gccacggcag 
tatcatccgc 
gactaccccg 

ctggctggcg 
ggtctgcatg 
ctgggcaatg 
ctgcgcggcg 
ggtggcgatg 
cttctgcgcg 



cacacaccca 
cggaaatcct 
cgctgggcgg 
aggtccgcga 
cggaaagcct 
gccaggacag 
gacatttccg 
aaggcacggc 
acaccagcaa 
ccattaccga 
ccgaacgttg 
gcttttatct 



cgatggcgtg 
gcagcagccc 
cgtcgtacct 
acccgacccc 
cggctttgac 
c atcgtcatc 
gggcggacgc 
actccttcct 
cgccattcag 
ctttcgccag 
gcaacccggc 
gtggatacgg 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

726 



<210> 65 
<211> 242 
<212> PRT 

<213> Xylessa fastidiosa 



<400> 65 

Met Asn Asp Trp Leu Phe Asp Leu Gly Asn Ser Arg Phe Lys Cys Ala 
15 10 15 



Ser Leu Arg Glu Gly Val He Gly Pro Val Thr Val Leu Pro Tyr Leu 
20 25 30 
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Thr Glu Thr Met Asp Ala Phe Ala Leu Gin Glu Leu Pro Arg Gly Arg 
35 40 45 

Val Ala Tyr Leu Ala Ser Val Ala Ala Pro Ala lie Thr Thr His Val 
50 55 60 

Leu Glu Val Leu Lys lie His Phe Glu Gin Val Gin Val Ala Ala Thr 
65 70 75 80 

Val Ala Ala Cys Ala Gly Val Arg He Ala Tyr Ala His Pro Glu Arg 
85 90 95 

Phe Gly Val Asp Arg Phe Leu Ala Leu Leu Gly Ser Tyr Gly Glu Gly 
100 105 110 

Asn Val Leu Val Val Gly Val Gly Thr Ala Leu Thr He Asp Leu Leu 
115 120 125 

Ala Ala Asn Gly Cys His Leu Gly Gly Arg He Ser Ala Ser Pro Thr 
130 135 140 

Leu Met Arg Gin Ala Leu His Ala Arg Ala Glu Gin Leu Pro Leu Ser 
145 150 155 160 

Gly Gly Asn Tyr Leu Glu Phe Ala Glu Asp Thr Glu Asp Ala Leu Val 
165 170 175 

Ser Gly Cys Asn Gly Ala Ala Val Ala Leu He Glu Arg Ser Leu Tyr 
180 185 190 

Glu Ala His Gin Arg Leu Asp Gin Ser Val Arg Leu Leu Leu His Gly 
195 200 205 

Gly Gly Val Ala Ser Leu Leu Pro Trp Leu Gly Asp Val Val His Arg 
210 215 220 

Pro Thr Leu Val Leu Asp Gly Leu Ala He Trp Ala Ala Val Ala Ala 
225 230 235 240 

Asn Val 



<210> 66 
<211> 729 
<212> DNA 

<213> Xylella fastidiosa 



<400> 66 

atgaatgatt 

ggtgtgattg 

ttacaggagc 

actacacatg 

gtcgctgcat 

aggttcttag 

acagcattga 

gcttcaccga 

ggtgggaact 

ggtgcagcgg 

tcggttcgat 

gtggtacatc 



ggttattcga 
gtcctgtaac 
taccacgtgg 
tgctcgaagt 
gtgccggagt 
cgttgcttgg 
ctattgattt 
cattgatgcg 
acttggagtt 
tggcattgat 
tattgttgca 
gtcctacatt 



tctaggtaat 
ggttttgccg 
tcgtgtggct 
attaaaaatc 
acgaattgcc 
ttcgtatggt 
gttggctgcc 
ccaagcgttg 
tgcggaagat 
cgaacgtagc 
tggtggaggt 
agtattggat 



tcgcgtttta 
tacttaacag 
tacttggcga 
cacttcgagc 
tatgctcacc 
gagggcaatg 
aatggttgtc 
catgcacgcg 
acagaggatg 
ctgtatgagg 
gtagcatctt 
ggcctggcga 



aatgtgcatc 
agaccatgga 
gtgtcgctgc 
aagtccaggt 
cggaacgttt 
tcctggtagt 
atctcggagg 
cggagcaact 
cgttggtgtc 
cacatcaacg 
tattgccttg 
tctgggctgc 



gctcagggaa 
cgcgtttgcg 
tccggctatt 
ggctgcaacc 
tggagtggat 
gggtgtcggg 
gcgtatcagt 
ccccctcagt 
agggtgcaat 
tttggaccag 
gttgggcgac 
cgttgcagct 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 
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7 ? Q 

aacgtttag 

<210> 67 
<211> 223 
<212> PRT 

<213> Helicobacter pylori 
<400> 67 

Met Pro Ala Arg Gin Ser Phe Thr Asp Leu Lys Asn Leu Val Leu Cys 
15 10 15 

Asp He Gly Asn Thr Arg He His Phe Ala Gin Asn Tyr Gin Leu Phe 
20 25 30 

Ser Ser Ala Lys Glu Asp Leu Lys Arg Leu Gly lie Gin Lys Glu He 
35 40 45 

Phe Tyr He Ser Val Asn Glu Glu Asn Glu Lys Ala Leu Leu Asn Cys 
50 55 60 

Tyr Pro Asn Ala Lys Asn He Ala Gly Phe Phe His Leu Glu Thr Asp 
65 70 75 80 

Tyr Val Gly Leu Gly lie Asp Arg Gin Met Ala Cys Leu Ala Val Asn 
85 90 95 

Asn Gly Val Val Val Asp Ala Gly Ser Ala He Thr He Asp Leu He 
100 105 HO 

Lys Glu Gly Lys His Leu Gly Gly Cys He Leu Pro Gly Leu Ala Gin 
115 120 125 

Tyr lie His Ala Tyr Lys Lys Ser Ala Lys He Leu Glu Gin Pro Phe 
130 135 140 

Lys Ala Leu Asp Ser Leu Glu Val Leu Pro Lys Ser Thr Arg Asp Ala 
145 150 155 160 

Val Asn Tyr Gly Met Val Leu Ser Val He Ala Cys He Gin His Leu 
165 170 175 

Ala Lys Asn Gin Lys He Tyr Leu Cys Gly Gly Asp Ala Lys Tyr Leu 
180 185 190 

Ser Ala Phe Leu Pro His Ser Val Cys Lys Glu Arg Leu Val Phe Asp 
195 200 205 

Gly Met Glu He Ala Leu Lys Lys Ala Gly He Leu Glu Cys Lys 
210 215 220 



<210> 68 
<211> 672 
<212> DNA 

<213> Helicobacter pylori 
<400> 68 

atgccagcta ggcaatcttt tacagatttg 
acgcgtatcc attttgcaca aaactatcag 
cgtttgggta ttcaaaagga aattttttac 
cttttgaatt gttaccctaa cgctaaaaat 



aaaaacctgg ttttgtgcga tataggcaac 60 
ctcttttcaa gcgctaaaga agatttaaag 120 
attagcgtga atgaagaaaa tgaaaaagcc 180 
attgcagggt tttttcattt agaaaccgac 240 
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tatgtagggc 
gtggatgccg 
tgtattttac 
gagcaacctt 
gtgaattacg 
aaaatctatc 
tgcaaggagc 
gaatgcaaat 



ttgggataga 
ggagtgcgat 
ccggtttagc 
tcaaggcctt 
gcatggtttt 
tttgtggggg 
gtttggtttt 
ga 



ccggcaaatg 
tacgatagat 
ccaatatatt 
agattcttta 
gagcgtcatt 
cgatgcgaag 
tgacgggatg 



gcgtgtctgg 
ttaatcaaag 
catgcgtata 
gaagttttac 
gcttgtatcc 
tatttgagcg 
gaaatcgctc 



cggtaaataa 
agggcaagca 
aaaaaagcgc 
ctaaaagcac 
agcatttagc 
cgtttttacc 
ttaaaaaagc 



tggcgtggtg 300 
tttaggaggg 360 
taaaatttta 420 
tagagacgct 480 
caaaaatcaa 540 
ccattctgtt 600 
agggatacta 660 
672 



<210> 69 
<211> 8320 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid, pAN296 



<400> 69 

tgcgccgcta 

tcggtgcggg 

ttaagttggg 

ttgtaatacg 

gatgaccgct 

tttaacgatg 

gtcatattgg 

catatagcgg 

gcatacttat 

ttgatagaaa 

ctggatgtcg 

gtactgatta 

gacatgaaag 

cttccgcaaa 

aaaatcgcca 

tcaaacgctt 

ggcccgattc 

gtcggccgct 

acgaaaaagc 

ctgtttccag 

aactgcctgt 

gattgttgaa 

cagaaactga 

ttaccccgtg 

tttgcacctt 

tctccgcttt 

tttcccggaa 

gcgaggggcg 

atgcttcagg 

tcccgctatt 

ggcgcggcac 

aaggatctcc 

gcggtgaacg 

gaacattctc 

ctt gactgta 

ggagtcggcg 

gttcatgtgt 

ggacttcacg 

ggacttgccg 

ggctgttata 

gaagataaat 

tcgacagaag 

gtctggggaa 



cagggcgcgt 

cctcttcgct 

taacgccagg 

actcactata 

tttcaggtct 

tcagaaacgg 

gaaggaaata 

gaaaaaggat 

gcgcggctga 

tcaaaaggaa 

gccgcctaga 

acaatgcagg 

cgatgtttga 

tgcttgagca 

caccgaagtc 

tgcggatgga 

agacggactt 

ggatgcttga 

gggagatcaa 

ctcttgtaga 

ggtggagtgg 

tgtcagttca 

ttgatgcaga 

cagatgagtg 

acattgccga 

ttgaggtgca 

gatggctttt 

ggatttatga 

aaaccgatga 

cgattgctgt 

tcgggtttcg 

tccgccaaaa 

aagggagacg 

gtcagctgtt 

tgaatataaa 

gaggaccggg 

ttttcgcaga 

agaagatctc 

tagggaggcc 

cggtagagga 

atttagagcc 

agggaaggcg 

cgggaggaag 



ccattcgcca 

attacgccag 

gttttcccag 

gggcgaattg 

gtcagcagct 

caggtcacct 

ccgtatactt 

ttggataacc 

aggagcccat 

aataaccgag 

ggatatcgcc 

cttcggtata 

tgtgaatgtc 

aaaaaaggga 

tagcctgtat 

gctttcggga 

tttttccatt 

tcctgatgac 

tcttccgcgt 

aaagctggca 

cttgtttctc 

ttcagaagtc 

gcctgtattc 

ggcaaccgag 

ggtgtttcct 

gcatatgaag 

aaagtgcgac 

agtgttaaag 

ttaccgcatc 

cggttcgaca 

cgtgacagtg 

gggagtcact 

gcaggcggaa 

cttaggatat 

gccaagtctt 

cggtgtagca 

accaactcat 

cgtccaggat 

gtcaggattt 

caatacgctt 

ctctgctctt 

ctatgctcag 

catggttcca 



ttcaggctgc 

ctggcgaaag 

tcacgacgtt 

ggcccgacgt 

ttttcctgct 

tgatgtgttg 

taaaacaatg 

ggcgcttcag 

gtcctgctgt 

gaatggagcg 

cgggtccgcg 

tttgaaacgg 

ttcggcctga 

catatcatca 

tccgcgacca 

accggcattt 

gctgataaag 

gtggcagctc 

ttaatgaatg 

ggacgcgcgc 

acggggcagt 

cttcatgctc 

tgggcaaatt 

atagcggaag 

gagacgaaag 

ggaaagctgg 

catgagcttc 

tatgctgaaa 

ttacaggaag 

ggaaatctag 

catatgtccg 

gttatggagt 

caagatccat 

gctgttgctg 

gagacgccct 

tttgggctga 

tcaccttgta 

atcggcctgg 

gtcggcaagc 

tatactttgc 

gctggcatgt 

aaatataaga 

aaagatgaaa 



gcaactgttg 

ggggatgtgc 

gtaaaacgac 

cgcatgctgg 

gtatatgaaa 

gaggctgaag 

aagaggtgaa 

gagggcttgg 

cggctagacg 

gacagtgtga 

atcagatcgg 

ttttagactc 

tcgcctgtac 

atatcgcctc 

aacatgccgt 

atgtgacaac 

gcggggacta 

aaattacagc 

ccggcactaa 

tcatgaaaaa 

ttttgatagt 

tgcttaaaga 

caggcaagaa 

cggaaaaaag 

gcgctaaagg 

aagcggcata 

cgatttcagg 

atctcgcgct 

agcggtttac 

gtttaagcat 

ccgatgctaa 

acgaaacaga 

tctgttattt 

caagccgatt 

tgtttgtgta 

agcttttata 

tgctgttagg 

ataatcagac 

tgattgaacc 

ttcatatgct 

tcgggccggt 

tggaacatgc 

tggctgcgta 



ggaagggcga 

tgcaaggcga 

ggccagtgaa 

atgaaaagcc 

aggaattgtt 

aagccatcac 

atgggtgaaa 

agaaagaatc 

cgaggatcgt 

gatttttcct 

ctcgattgat 

tacattggat 

aaaagcggtg 

tcaagcgggg 

gttaggttac 

agtcaacccg 

cgccaaaaat 

tgcaattttt 

gctgtatcag 

ataatgatag 

ggaagggaga 

tccgtttatt 

agaggggcca 

aatgcagcgg 

aatcatcgag 

tcagcagcca 

atcgattaaa 

tcaagaagga 

cgggtttttc 

cggcatcatc 

gcagtggaaa 

ttacagtgaa 

tattgatgat 

aaaaacacag 

tctgccgtgc 

cggagatgat 

gctttattca 

ggctgctgac 

gcttctgagc 

ggctgtatct 

tcagcttttt 

cgtacatgtc 

taaccgaatc 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 
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ggtgctgatt tgctaaaaaa acgaaatgga 
tacaatcttt gtaagggttt taacctacag 
aacttaacct acatacttta tatacacagc 
tagttaccct tattatcaag ataagaaaga 
aaaaaaacac aaaagaccac attttttaat 
ttagttcaac aaacgaaaat tggataaagt 
cagtaatatt gacttttaaa aaaggattga 
cctaaattca ctttagataa aaatttagga 
tttagacaat tggaagagaa aagagatatt 
tagtataacc acagaaattg atattagtgt 
taaattttac cctgcattta ttttcttagt 
tagaactggt tacaatagcg acggagagtt 
tacaattttt gatggtgtat ctaaaacatt 
cttcaaagag ttttatgatt tatacctttc 
attgtttccc aaaacaccta tacctgaaaa 
ttcatttact gggtttaact taaatatcaa 
tacagcagga aaattcatta ataaaggtaa 
acatcattct gtttgtgatg gttatcatgc 
gtcagatagg cctaatgact ggcttttata 
agtcggtttt ctaatgtcac taacctgccc 
ctgtcgacta aggtcgagga agtgttggta 
ctgtatgtct ctgatttgga ggcgtctagg 
ggttataaag agtatcaaaa atggagctca 
ctagtgattg tgcaggcgaa agagccattt 
ggtctgaacc atctcgcatt tcatgctgaa 
aaattgacgg caaaaggcta tcgtgtgttg 
gacgggcatt atgcagtctt ttgtgaggat 

ccaagctgtt aatcgtgatc ttcggacagg 
tgcgcttttc ggtttttcgc atacttgaag 
aatataataa atacaaacag ctgaaacatc 
tgtttgcggg agagattcat tctcttccgt 
aacggtgttt tgtggtctcc attttcattt 
cgagcagggt aatgccgctc aggacagaaa 
ggtctataca tatccccccg acccaagggc 
ttgccccgaa ataagtgcct tttaatcctg 
tcatagagaa taaaagcact tcgccgattg 
gaacaccgtg tgatacggtg aaaatggcca 
tcagagaaca aagcggcgaa aaccgttttg 
acacaacgat tgcgtttaat gtcagcatca 
aggggttctg cgccatatac tgagggaatg 
atagcgtaat gccgaccaaa gcaatggtaa 
cttcccgcac gctcatattt cgggactggg 
attggagggc aagcacaatt ccgtatagtc 
tagtcgattg cgatgagccg aaatataggc 
taatagccgc atagcgtaaa ttaaaaacta 
acaacaaggc ctttgaagcg ggctcaaaca 
ctacaaaaaa cacccagaga ttagatgctg 
cgaaaatcga tacaagcatc atgttttttc 
aaaagcttgc gaggatgccg actgatgagc 
ctgatgcgcc ttggacggct gtcaaataaa 
ccattctgcc gaaaatggtt ccgattataa 
ttctatagtg tcacctaaat agcttggcgt 
attgttatcc gctcacaatt ccacacaaca 
ggggtgccta atgagtgagc taactcacat 
agtcgggaaa cctgtcgtgc cagctgcatt 
gtttgcgtat tgggcgctct tccgcttcct 
ggctgcggcg agcggtatca gctcactcaa 
gggataacgc aggaaagaac atgtgagcaa 
aggccgcgtt gctggcgttt ttcgataggc 
gacgctcaag tcagaggtgg cgaaacccga 
ctggaagctc cctcgtgcgc tctcctgttc 



aaataagcag acagtgaaaa ggttttccgt 2640 
agagtcaggt gtaaacagtg aaaaataaag 2700 
acaatcggga gtcttctgca gctcgagcaa 27 60 
aaaggatttt tcgctacgct caaatccttt 2820 
gtggtcttta ttcttcaact aaagcaccca 2880 
gggatatttt taaaatatat atttatgtta 2940 
ttctaatgaa gaaagcagac aagtaagcct 3000 
ggcatatcaa atgaacttta ataaaattga 3060 
taatcattat ttgaaccaac aaacgacttt 3120 
tttataccga aacataaaac aagaaggata 3180 
gacaagggtg ataaactcaa atacagcttt 3240 
aggttattgg gataagttag agccacttta 3300 
ctctggtatt tggactcctg taaagaatga 3360 
tgatgtagag aaatataatg gttcggggaa 3420 
tgctttttct ctttctatta ttccatggac 3480 
taataatagt aattaccttc tacccattat 3540 
ttcaatatat ttaccgctat ctttacaggt 3600 
aggattgttt atgaactcta ttcaggaatt 3660 
atatgagata atgccgactg tactttttac 3720 
cgttagttga agaaggtttt tatattacag 3780 
aggagggtat gaaatgtgca tcatattgaa 3840 
cggttttggg gctggttctt aaaagaactt 3900 
ggcatcagct ggaagaaaga tcgtttttac 3960 
ctagagccgg aataccatag atgccgagtc 4020 
tccaagcttc aagtcgatca gatgactgaa 4080 
taccgagaca ggcatccttt tgccggagga 4140 
ccagaccgga ttaaggtaga gctcgttgcc 4200 

ctgttcagct ttttctcaat gcgatccagc 4260 
cctgtaacag ccgcaaagac gacagcggca 4320 
acatcaccta tattcatgtt cttcacctca 4380 
tttttattta aagcggcttt tccagacggg 4440 
gccgataggc gaacgctaaa aatggcaggc 4500 
aaatataaat cggccggcca gcgccaaaca 4560 
cgatgacgtt tccgagctgt ggaaaaccga 4620 
gttttgcaat ctggtctaca tacaaatcca 4680 
taaatgtgat gacaatcatc acaattgatg 4740 
tgctgatgct aaccatcaca ttaccgagca 4800 
caaaatggac aatgggaaat tgcgtcgcca 48 60 
gcccatacag cttcgttcca ttgccgatca 4920 
tggaactgaa ttgtgagtag ccgaaggtgc 4980 
aaagataatc cttttgcgtg accataaacg 5040 
ctggtgctga taaggatgga tgttttttaa 5100 
cgtaaatgac tgcaggcacc aaaaagggcg 5160 
caagcacagg tccgaagaca acgccgatat 5220 
gcagtctcgt tttttcttct gtcatatcag 5280 
gtgatttgca aagaccgttt aatgcgttta 5340 
ccgcaaagcc tgcaaatacc agcatccatc 5400 
tgccgaattt atctgagata tatccgccgt 5460 
tcgcggcgat gaccagccct gcataggaag 5520 
tcgctaaaaa aggaatgctc atcgatgttg 5580 
ttgtaacgcg ttggatgcat agcttgagta 5640 
aatcatggtc atagctgttt cctgtgtgaa 5700 
tacgagccgg aagcataaag tgtaaagcct 5760 
taattgcgtt gcgctcactg cccgctttcc 5820 
aatgaatcgg ccaacgcgcg gggagaggcg 5880 
cgctcactga ctcgctgcgc tcggtcgttc 5940 
aggcggtaat acggttatcc acagaatcag 6000 
aaggccagca aaaggccagg aaccgtaaaa 6060 
tccgcccccc tgacgagcat cacaaaaatc 6120 
caggactata aagataccag gcgtttcccc 6180 
cgaccctgcc gcttaccgga tacctgtccg 624 0 
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cctttctccc 

cggtgtaggt 

gctgcgcctt 

cactggcagc 

agttcttgaa 

ctctgctgaa 

ccaccgctgg 

gatctcaaga 

cacgttaagg 

attaaaaatg 

accaatgctt 

ttgcctgact 

gtgctgcaat 

agccagccgg 

ctattaattg 

ttgttggcat 

gctccggttc 

ttagctcctt 

tggttatggc 

tgactggtga 

cttgcccggc 

tcattggaaa 

gttcgatgta 

tttctgggtg 

ggaaatgttg 

attgtctcat 

cgcgcacatt 

aaggagaaaa 

taaatatttg 

ataaatcaaa 

cactattaaa 

gcccactacg 

taaatcggaa 

tggcgagaaa 

cggtcacgct 



ttcgggaagc 

cgttcgctcc 

atccggtaac 

agccactggt 

gtggtggcct 

gccagttacc 

tagcggtggt 

agatcctttg 

gattttggtc 

aagttttaaa 

aatcagtgag 

ccccgtcgtg 

gataccgcga 

aagggccgag 

ttgccgggaa 

tgctacaggc 

ccaacgatca 

cggtcctccg 

agcactgcat 

gtactcaacc 

gtcaatacgg 

acgttcttcg 

acccactcgt 

agcaaaaaca 

aatactcata 

gagcggatac 

tccccgaaaa 

taccgcatca 

ttaaatcagc 

agaatagacc 

gaacgtggac 

tgaaccatca 

ccctaaaggg 

ggaagggaag 

gcgcgtaacc 



gtggcgcttt 

aagctgggct 

tatcgtcttg 

aacaggatta 

aactacggct 

ttcggaaaaa 

ttttttgttt 

atcttttcta 

atgagattat 

tcaatctaaa 

gcacctatct 

tagataacta 

gacccacgct 

cgcagaagtg 

gctagagtaa 

atcgtggtgt 

aggcgagtta 

atcgttgtca 

aattctctta 

aagtcattct 

gataatagtg 

gggcgaaaac 

gcacccaact 

ggaaggcaaa 

ctcttccttt 

atatttgaat 

gtgccacctg 

ggcgaaattg 

tcatttttta 

gagatagggt 

tccaacgtca 

cccaaatcaa 

agcccccgat 

aaagcgaaag 

accacacccg 



ctcatagctc 

gtgtgcacga 

agtccaaccc 

gcagagcgag 

acactagaag 

gagttggtag 

gcaagcagca 

cggggtctga 

caaaaaggat 

gtatatatga 

cagcgatctg 

cgatacggga 

caccggctcc 

gtcctgcaac 

gtagttcgcc 

cacgctcgtc 

catgatcccc 

gaagtaagtt 

ctgtcatgcc 

gagaataccg 

tatgacatag 

tctcaaggat 

gatcttcagc 

atgccgcaaa 

ttcaatatta 

gtatttagaa 

tatgcggtgt 

taaacgttaa 

accaataggc 

tgagtgttgt 

aagggcgaaa 

gttttttgcg 

ttagagcttg 

gagcgggcgc 

ccgcgcttaa 



acgctgtagg 

accccccgtt 

ggtaagacac 

gtatgtaggc 

gacagtattt 

ctcttgatcc 

gattacgcgc 

cgctcagtgg 

cttcacctag 

gtaaacttgg 

tctatttcgt 

gggcttacca 

agatttatca 

tttatccgcc 

agttaatagt 

gtttggtatg 

catgttgtgc 

ggccgcagtg 

atccgtaaga 

cgcccggcga 

cagaacttta 

cttaccgctg 

atcttttact 

aaagggaata 

ttgaagcatt 

aaataaacaa 

gaaataccgc 

tattttgtta 

cgaaatcggc 

tccagtttgg 

aaccgtctat 

gtcgaggtgc 

acggggaaag 

tagggcgctg 



tatctcagtt 

cagcccgacc 

gacttatcgc 

ggtgctacag 

ggtatctgcg 

ggcaaacaaa 

agaaaaaaag 

aacgaaaact 

atccttttaa 

tctgacagtt 

tcatccatag 

tctggcccca 

gcaataaacc 

tccatccagt 

ttgcgcaacg 

gcttcattca 

aaaaaagcgg 

ttatcactca 

tgcttttctg 

ccgagttgct 

aaagtgctca 

ttgagatcca 

ttcaccagcg 

agggcgacac 

tatcagggtt 

ataggggttc 

acagatgcgt 

aaattcgcgt 

aaaatccctt 

aacaagagtc 

cagggcgatg 

cgtaaagctc 

ccggcgaacg 

gcaagtgtag 



6300 

6360 

6420 

6480 

6540 

6600 

6660 

6720 

6780 

6840 

6900 

6960 

7020 

7080 

7140 

7200 

7260 

7320 

7380 

7440 

7500 

7560 

7620 

7680 

7740 

7800 

7860 

7920 

7980 

8040 

8100 

8160 

8220 

8280 

8320 



<210> 70 
<211> 6688 
<212> DNA 

<213> Artificial Sequence 



<220> 



<223> Description of Artificial Sequence: plasmid, pAN336 



<400> 70 

tgcgccgcta 

tcggtgcggg 

ttaagttggg 

ttgtaatacg 

ggcgctgact 

aaaaaaatcg 

aagaagagcc 

cacaccgtta 

cctcggggcc 

acaaagccgg 

attatcttcg 

gcaagacgaa 

cagtctcagg 

gatcaaatcg 

cgtgcgaaac 

ctgaaagttg 



cagggcgcgt 
cctcttcgct 
taacgccagg 
actcactata 
tagaaaacct 
atgcgcgtga 
gcgttatctc 
tcggtctcgt 
aggctggcgg 
agctgcttga 
gtgaagtcag 
tggttacaga 
gcggtcaggt 
cttacgaaat 
aaatcctgac 
aaacgcttga 



ccattcgcca 
attacgccag 
gttttcccag 
gggcgaattg 
cttgaatgaa 
tattgacgaa 
caagaaagaa 
tttagatgag 
ttatgctgtt 
taaaattgtc 
cacaggggcg 
attcggtatg 
attcttaggc 
tgatcaggaa 
tgaaaatcgt 
cgctgaacaa 



ttcaggctgc 
ctggcgaaag 
tcacgacgtt 
ggcccgacgt 
gctgcgcttg 
gcgacggacc 
cgcaatatcg 
gcagatatgg 
atgctgccaa 
ggcctcttgg 
cacaatgact 
tcagaaaaac 
cgtgatttca 
attcagcgca 
gacaagcttg 
atcaaacacc 



gcaactgttg 
ggggatgtgc 
gtaaaacgac 
cgcatgcacc 
tagcggctcg 
gtgtaattgc 
tggcttatca 
ttcataaagt 
gagaagaccg 
gcggacgtgt 
tccagcgtgc 
tgggaccgtt 
acaacgaaca 
tcatcaaaga 
aattgattgc 
ttatcgatca 



ggaagggcga 
tgcaaggcga 
ggccagtgaa 
aggcttctca 
tcaaaacaag 
cggacccgct 
cgaaggcgga 
aacgattgtt 
ttatttccaa 
tgctgaagag 
gacgaatatt 
gcaatttgga 
gaactacagt 
atgttatgag 
ccaaacgctt 
tggaacatta 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 
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cctgagcgta atttctcaga 
aaaacagaag aaaagaaaga 
ccggctgacg ctggcagttt 



atccaattct 
atgtttgtaa 
ttatcgatgt 
aaataccgca 
gaggtatgaa 
aagctaccaa 
caatactgat 
gggggcaagg 
acttgcatgg 



ggtatcgatt 
aaagtaaagt 
ggggaacacc 
tcaggcgata 
aacgagaatt 
gacgaagagg 
aagataatat 
cataggcagc 
actaatgctt 



tgatgaaaag aacgatgatg tgaaagtaaa cattctgaca 
cgatacgaaa gagtaattcg ctttctttct aaaaaaactg 
atgattggct cagctgcggc ttttacaatc 
tgagccgctg atcgtgtatg gtattgtaga 
ttcaaaagtg gtgatagagg ttgttactgg 
agtgaaatac cgcacagatg cgtaaggaga 



ttttatgtaa 
tgtttacaaa 
agagaaacta 
ctgcagctcg 
aacccagcga 
ggacctttac 
atgaagagga 
atcttttata 
gcgcttatca 



accatttgag 
agaattactc 
tgaggaggca 
tagaagatat 
atatatctat 



ataattgtgg tttcaaaatc 



gaaacccagg acaataacct 
ggctccgtcg atactatgtt 



gtgataggta 
tatgaagcgc 
gattgccttg 
cgccgtatgt 
agaatgggca 
tatagcttgt 



agattatacc 
catatttaaa 
aatatattga 
aaggatttca 
aagcataaaa 
aaattct ate 



atacgccaac tttcaaaaca 



1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 



actttgaaaa agctgttttc tggtatttaa ggttttagaa tgcaaggaac agtgaattgg 1800 



gttataatta 

atggctaaaa 

aaagataegg 

ctatatttaa 

aaggacatga 

eggcatgatg 

gagtatgaag 

ctctttcact 

gecgaattgg 

gaagacactc 

gaagaggaac 

ggcaaagt aa 

attgecttet 

ttt'ettgact 

gatgaattgt 

gacatctaat 

ttctaaaagt 

ccattcaggc 

aggacccaag 

tatagacatc 



agttcgtctt 
gaaataataa 
ccgctgcgta 
aaatgaaaac 
ggaacgggaa 
gcactttgaa 
ttgctcggaa 
gtgeatcagg 
cagccgctta 
aaactgggaa 
ggaaaagece 
tgtgaaagat 
gtggtatgac 
tgtcgagcta 
tattttactg 
aagcttttta 
tttatatctt 
tcgccattcg 
caggcaaaac 
aatcagattg 
atgaaagaaa 
acttgegtat 
ggggcagaga 
agcttcactt 

gggcggaggt 

ctatgtctct 

acgtgcggtt 

cttcacagga 

tgtgtcatct 

taccattctt 

aatcacaaaa 

agggctgaca 

aacgatgect 

aggactaggc 

atgccatttt 

ccaaactacc 

gtttgaaatc 

acctaaatag 

tcacaattcc 

gagtgagcta 

tgtcgtgcca getgeattaa 
ggcgctcttc cgcttcctcg 



gcttcttggg 
tgagaatatc 
aaggaatgtc 
aaatgacgga 
tgctatggct 
gctggagcaa 

atgaacaaag ccctgaaaag 
ccatcgacat ateggattgt 
attacttact 
catttaaaga 
ttgtcttttc 
gtggctttat 
gcgtccggtc 
tactggggat 
tttagtacct 
cttttctgaa 
tegctagata 
tgcgcaactg 
gtcattgega 
gttgatccat 



gtatctttaa atactgtaga 
aceggaattg aaaaaactga 
tectgetaag gtatataagc 
cagceggtat aaagggacca 



ccgcgtagga agtgtatagg 
gaeggaaaag ttegggctta 



cgccatggta 
atgctcggcg 
ccgateggag 
aacccgcaag 
ggaacaaacg 
caagtagaaa 
gagcaggttc 
gcggcagggg 



cgtggccgac 
etatgetgaa 
ctattgtagc 
ttcattttga 
gaacgttaag 
tegtttcagg 
cttcatcagt 
gctttattat 



aaagaggaag 
tcgaaaaata 
tggtgggaga 
cctatgatgt 
caaaggtcct 
atggcgtcct 
tgtatgcgga 
atagcttaga 
tggattgega 
ttttaaagac 
gcaacatctt 
gggcggacaa 
aagaacagta 
taaaatat ca 
ctttaactac 
actctgatgt 
ggaatttgta 
ggtcgactgg 
attgegaacg 
gaattgattt 
tagtaaaagc 
tggtaaatga 
gtacaatgac 



attgaacagc gtctatctca 
ccagaagaaa ttttagaaga 



eggtatcage 
gaaagaacat 
tggcgttttt 
agaggtggcg 
tcgtgcgctc 



gtcagattcc attgeccttg 
aaaaaagaaa ttcaagatat 
tgtaatgaaa agtacttatt 
cgctaagctc tttagcgggt 
aagaacaatc tggacgcgtt 
cttggcgtaa tcatggtcat 
acacaacata egagceggaa 
actcacatta attgcgttgc 
tgaateggee 
ctcactgact 
tcactcaaag geggtaatae 
gtgagcaaaa ggccagcaaa 
cgataggctc cgcccccctg 
aaacccgaca ggactataaa gataccaggc 
tcctgttccg accctgccgc ttaceggata 



ggaaggaaag ctgcctgttc 
tetgetcatg agtgaggccg 
attatcgagc 
ccctatacga 
gaataacgat ctggccgatg 
tccgcgcgag ctgtatgatt 
ccacggcgac ctgggagaca 
tgatcttggg agaageggea 
gatcagggag gatategggg 
caagectgat tgggagaaaa 
agatttagat gtctaaaaag 
gtacatccgc aacrgtccat 
ggggtcccga gcgcctacga 
ttgggaaggg cgatcggtgc 
caggaggect ggcgccgctc 
tcttaaccct aaaagggctg 
aggtttagta atggattatt 
tgeagegaga acgactgata 
agcatccgct gcactaggcc 
gggegatgat aagctgaccg tgaaaatcga 
tgatgccaat gecaaaggag aagtcagagc 
tttaaatgaa caaggtaagc ttgatgtcag 
tgtcgtaaaa gatttaggtt tgcgcgagtt 
agaattagga gatgatttta cttactatct 
gggcgtaggt gtgctcgtaa atcctgacaa 
tcagctgatg ccgggaacag atgatgaaac 
agtagagccg atttctaagc tcatccaaaa 
agtcctaggc gagaaacctg agattttgga 
ttcaaaagaa cggttcgaaa cagecatttt 
gatagaagaa gatggacaag ccgaagcagt 
gagctggaag ggcttcgtga 
agaaaagggg ctgaaagcag 
cttgagtatt ctatagtgtc 
tgtgtgaaat tgttatccgc 
taaagcctgg ggtgcctaat 
gctcactgcc cgctttccag tegggaaace 
aacgegeggg gagaggeggt ttgcgtattg 
ggtcgttcgg ctgeggegag 
agaatcaggg gataaegcag 
ccgtaaaaag gccgcgttgc 
caaaaatcga cgctcaagtc 
gtttccccct ggaagctccc 
cctgtccgcc tttctccctt 



tacaaaagaa 
ttttaatttg 
ggatgeatag 
agctgtttcc 
gcataaagtg 



cgctgcgctc 
ggttatccac 
aggecaggaa 
acgagcatca 



1860 
1920 
1980 
2040 
2100 
2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
4020 
4080 
4140 
4200 
4260 
4320 
4380 
4440 
4500 
4560 
4620 



OGZ-001 



-52- 



cgggaagcgt 
ttcgctccaa 
ccggtaacta 
ccactggtaa 
ggtggcctaa 
cagttacctt 
gcggtggttt 
atcctttgat 
ttt tggtcat 
gtt ttaaatc 
tcagtgaggc 
ccgtcgtgta 
taccgcgaga 
gggccgagcg 
gccgggaagc 
ctacaggcat 
aacgatcaag 
gtcctccgat 
cactgcataa 
act caaccaa 
caatacggga 
gtt cttcggg 
ccactcgtgc 
caaaaacagg 
tactcatact 
gcggatacat 
cccgaaaagt 
ccgcatcagg 
aaatcagctc 
aatagaccga 
acgtggactc 
aaccatcacc 
ctaaagggag 
aagggaagaa 
gcgtaaccac 



ggcgctttct 
gctgggctgt 
tcgtcttgag 
caggattagc 
ctacggctac 
cggaaaaaga 
ttttgtttgc 
cttttctacg 
gagattatca 
aatctaaagt 
acctatctca 
gataactacg 
cccacgctca 
cagaagtggt 
tagagtaagt 
cgtggtgtca 
gcgagttaca 
cgttgtcaga 
ttctcttact 
gtcattctga 
taatagtgta 
gcgaaaactc 
acccaactga 
aaggcaaaat 
cttccttttt 
atttgaatgt 
gccacctgta 
cgaaattgta 
attttttaac 
gatagggttg 
caacgtaaaa 
caaatcaagt 
cccccgattt 
agcgaaagga 
cacacccgcc 



catagctcac 
gtgcacgaac 
tccaacccgg 
agagcgaggt 
actagaagga 
gttggtagct 
aagcagcaga 
gggtctgacg 
aaaaggatct 
atatatgagt 
gcgatctgtc 
atacgggagg 
ccggctccag 
cctgcaactt 
agttcgccag 
cgctcgtcgt 
tgatccccca 
agtaagttgg 
gtcatgccat 
gaataccgcg 
tgacatagca 
tcaaggatct 
tcttcagcat 
gccgcaaaaa 
caatattatt 
atttagaaaa 
tgcggtgtga 
aacgttaata 
caataggccg 
agtgttgttc 
gggcgaaaaa 
tttttgcggt 
agagcttgac 
gcgggcgcta 
gcgcttaa 



gctgtaggta 
cccccgttca 
taagacacga 
atgtaggcgg 
cagtatttgg 
cttgatccgg 
ttacgcgcag 
ctcagtggaa 
tcacctagat 
aaacttggtc 
tatttcgttc 
gcttaccatc 
atttatcagc 
tatccgcctc 
ttaatagttt 
ttggtatggc 
tgttgtgcaa 
ccgcagtgtt 
ccgtaagatg 
cccggcgacc 
gaactttaaa 
taccgctgtt 
cttttacttt 
agggaataag 
gaagcattta 
ataaacaaat 
aataccgcac 
ttttgttaaa 
aaatcggcaa 
cagtttggaa 
ccgtctatca 
cgaggtgccg 
ggggaaagcc 
gggcgctggc 



tctcagttcg 
gcccgaccgc 
cttatcgcca 
tgctacagag 
tatctgcgct 
caaacaaacc 
aaaaaaagga 
cgaaaactca 
ccttttaaat 
tgacagttac 
atccatagtt 
tggccccagt 
aataaaccag 
catccagtct 
gcgcaacgtt 
ttcattcagc 
aaaagcggtt 
atcactcatg 
cttttctgtg 
gagttgctct 
agtgctcatc 
gagatccagt 
caccagcgtt 
ggcgacacgg 
tcagggttat 
aggggttccg 
agatgcgtaa 
attcgcgtta 
aatcccttat 
caagagtcca 
gggcgatggc 
taaagctcta 
ggcgaacgtg 
aagtgtagcg 



gtgtaggtcg 
tgcgccttat 
ctggcagcag 
ttcttgaagt 
ctgctgaagc 
accgctggta 
tctcaagaag 
cgttaaggga 
taaaaatgaa 
caatgcttaa 
gcctgactcc 
gctgcaatga 
ccagccggaa 
attaattgtt 
gttggcattg 
tccggttccc 
agctccttcg 
gttatggcag 
actggtgagt 
tgcccggcgt 
attggaaaac 
tcgatgtaac 
tctgggtgag 
aaatgttgaa 
tgtctcatga 
cgcacatttc 
ggagaaaata 
aatatttgtt 
aaatcaaaag 
ctattaaaga 
ccactacgtg 
aatcggaacc 
gcgagaaagg 
gtcacgctgc 



4680 
4740 
4800 
4860 
4920 
4980 
5040 
5100 
5160 
5220 
5280 
5340 
5400 
5460 
5520 
5580 
5640 
5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6688 



<210> 71 
<211> 9396 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial 
<400> 71 

ttgcggccgc ttcgaactgt tataaaaaaa 
atcccttaac gatttagaaa tccctttgag 
aactaatgac aatgattcct gaaaaaagta 
gatcaacttc cataggtaac aacctttgat 
caattgcaat acctgttccc tctgataaaa 
caccagcttc ctgctgtttc aagctacttg 
acgaaagccc acctaaaaca aatacgatta 
aaagaaagga agcagttaaa aagctaacag 
ataaaggacc tcttctatca acaagtatcc 
ttgttccagg gaaaataatt acacttccga 
tcatcatata aggaaccata gagacaaacc 
aaagaactcc aatcataaaa ggtatatttt 
ttactttcct gatatgtttt acaaatatca 
aaatgctata tgatgttgta aacaacataa 
ttcctttgat atcaaaatga ccttttatcc 
acggaacagt gataattgtt atcataggaa 
gggctatcat tccaccaatc gctggaccga 



Sequence: plasmid, pAN341 and pAN342 



ggatcaattt 
aatgtttata 
ataacaaatt 
caagtaaggg 
agctggtaaa 
aaacaattgt 
taattgtcat 
aaagaaatgt 
caccaatgta 
tttcggcagt 
ctgctactgt 
tccctaatcc 
ggaatgacag 
aaaatacaat 
ttacttcttt 
tgagtagaag 
ctccttctcc 



tgaactctct 
tacattcaag 
actatacaga 
tatggataat 
gttaagcaaa 
tgatataact 
gaaccatgat 
aactccgatg 
gccgaaaata 
acttagctgg 
tccaaatata 
gggatcaaca 
cacgctaacg 
gcctacagac 
ctttaataat 
ataggaccaa 
catggctact 



cccaaagttg 
gtaaccagcc 
taagttgact 
aaaccaccta 
ctcattccag 
gttttggtga 
gttgtttcta 
tttaacacgt 
atgacactca 
tgaacatctt 
attcccccac 
aaaggatctg 
ataagaaaag 
attagtataa 
ttcataagaa 
tgaatataat 
atcgatccaa 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 
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taagaccaaa tgctttaccc ctattttcct 
cgagtgctgg aaatgcagct gcaccagccc 
aaaagaaaga atggccaaca aacccaatta 
ataggagtaa ccttttgatg cctaattgat 
tggaaaaggt taacataaag gctgtgttca 
aatcatttgc aatatcaggt aatgagacgt 
aagataaaat gcaaagccaa attaaaattt 
atgtattcac atttcaccct ccaataatga 
cgcttccctc ttttaattga accctgttac 
aaacttgatt aaaacatttt accacatata 
ttatacaaca atatggcccg tttgttgaac 
caattccaca ttgcaataat agaaaatcca 
tgaatcaaat cgccttcttc tgtgtcatca 
aaaccaccat aggagattaa ccttttacgg 
ttcaaattct tttcttcatc atcggtcata 
gtttcgtcaa ttgccgattg tatatccgat 
acttttacat ttggatcata gtctaatttc 
ttttgattca cgtagttttc tgtattctta 
tgcatgtgct gattataaga attatcttta 
aaaccaacaa gatttttatt aattttttta 
atatctgaca aactcttatt taattcttcg 
agaaacaacc aacgaactgt tggcttttgt 
tgaatgccat gtttcattgc tctcctccag 
aaccacactc gatacaactt tctttcgcct 
tcagcacaat cttttactct ttcagccttt 
taatcaacat tagcgatttt cttttctctc 
gttgcaaacg attctggata gtttgccgga 
aagagatccg tattccccgc tgaaaactca 
accatgtctc tgtaccattc atcaagtcca 
tcaggagcca tttccgggtg gggatttcct 
ccagcttcca aaatgtcagc atcagtgccg 
cattttgcgc caggtgctac accgatttga 
gtgcctgtca cgtgggttcc atgagccaaa 
gcatcatacc agttcatttc attrtcaggc 
tatttctctt ttaatgccgg atgattccat 
gtgccagttc catcatatcc aagtgcccaa 
cattccacac cgtcagtrgc tttaatagct 
gatgacttaa aaagctgccg tttctcatta 
accacttttt ccaraacctc ttttgaggca 
gaatgaattt ggtctgcatt tcctttatct 
tgctgggatt catcggctgt gacttttaaa 
tccgtcttag cggcagacag cttcttcgat 
ttttctgggt acccctatgt attactagaa 
ttcaatggag gtgatgtccg tttttcatta 
ctcttggacc cgggatccgc aagtgctttt 
cacttcctac gcggtttctt tcataaatca 
caacgatgtc tatacaatct gattcgttcg 
tgaccttggg tcctgttttg cctgccattt 
gacatagcca aataaaattc cagattgcat 
gtcgggccgg gtgatttcga tacgaggaag 
cgaaattgta atcccagggg caatcgcccc 
gcagtacgtt gtggcggttc cgaaatcgac 
cgcagcgaca gcatttacga ttctgtctgc 
atttaaaccg gttttcatac ctggaccaac 
gcacattctt tctaacgcaa acatgattgg 
tatctgttca aacataagcc cggagtgatc 
atcttctgtt ttatgcctgc ttgtttctat 
atatacacca agtacagtat tggtgttccc 
ccacttttga atagtttctc tagaacaggc 
acacaccctg tagggaaagt caataccttt 
tctagggaca cctctttgta cccctcaagg 



ttggaatata gcgcgcaact acaaccatta 1080 
cttgaataaa acgagccata ataagtaagg 1140 
ccgacccgaa acaatttatt ataattccaa 1200 
cagatagctt tccatataca gctgttccaa 1260 
cccagtttgt actcgcaggt ggtttattaa 1320 
tcaaaaccat ttcatttaat acgctaaaaa 1380 
ggttgtgtcg taaattcgat tgtgaatagg 1440 
gggcagacgt agtttatagg gttaatgata 1500 
attcattaca cttcataatt aattcctcct 1560 
aactaagttt taaattcagt atttcatcac 1620 
tactctttaa taaaataatt tttccgttcc 1680 
tcttcatcgg ctttttcgtc atcatctgta 1740 
aggtttaatt ttttatgtat ttcttttaac 1800 
tgtaaacctt cctccaaatc agacaaacgt 1860 
aaatccgtat cctttacagg atattttgca 1920 
ttatatttat ttttcggtcg aatcatttga 1980 
attgcctttt tccaaaattg aatccattgt 2040 
aaataagttg gttccacaca taccaataca 2100 
ttatttattg tcacttccgt tgcacgcata 2160 
tattgcatca ttcggcgaaa tccttgagcc 2220 
ccatcataaa catttttaac tgttaatgtg 2280 
ttaataactt cagcaacaac cttttgtgac 2340 
ttgcacattg gacaaagcct ggatttacaa 2400 
gtttcacgat tttgtttata ctctaatatt 2460 
ttaaattcaa gaatatgcag aagttcaaag 2520 
catggggaat tggaattctc agtcgctcca 2580 
tttgcgatag aaccaggccc gccgggaata 2640 
gggaaaatat cggccgaacg ccaggcattg 2700 
gagccccctc cccatgagtt attgacaaca 2760 
tccgcgtcct ttggtgctaa aacccattca 2820 
ccatcttcag agaacgcttt aacagcaatc 2880 
tttgttccat caggttcaga gcccaccatc 2940 
tcatcataag ggcttgcctc gcctgctacg 3000 
tcattaggat tttccggatt atatccgcga 3060 
tccaccccgg tatcaatgga cgcaacaacc 3120 
gcttttgggg catcgatttg gtctacattc 3180 
ttctgtgctt ttttcatatt aaatggggag 3240 
ggaagcacct tttccacttc gggaaactgc 3300 
tgaacagcaa tcccgttcac cacataataa 3360 
ttctgggtgt tcaagtattt taggacatct 3420 
gatgacacaa cagcagaacg cttttgatar 3480 
ttcgcttttt taacagccgc ttttgccgct 3540 
aataacatag taaaacggac atcactccgt 3600 
caacaaatta cttatctatt tgtaatgctg 3660 
actaaataat ccattactaa acctcctata 3720 
attccagccc ttttagggtt aagaatggat 3780 
caatgagcgg cgccaggcct cctgtcgcaa 3840 
cattcgctta acgattcctt ccacttggcc 3900 
cgcgctaaca gtgttttttc cgataatatt 3960 
ctttgctgca cgcgagtaaa gcgcctctgt 4020 
gcccatgtat tgtttgtttt catcaatata 4080 
aacaattaat ggattgccgt acaagtgtat 4140 
ccctacttct ttcggattgt catattttat 4200 
aatttgaggc tcgatatgaa agtattttgt 4260 
cggcactact gacgaaataa taatgccatc 4320 
aaataaggag cgcaaaatca tcccaaactc 4380 
acgccagtga tattctaatt ttccatcatg 4440 
cacatcgata accagtaaca acctctatca 4500 
ggggttgccc ccgcctgtaa ttaaattatt 4560 
ttgtaaaatt tttacacagc gtggatctct 4 620 
gagaaatatt ggcggtactg agcacagttt 4 680 
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tggttggtgg acagtgaacc atagctgtcg 
aaaggaaaca aaaagaccgt tttcacacaa 
acagccactt ttgcaaaaac cggacagctt 
agcttcgcga agcggccgca aaattcactg 
aaccctggcg ttacccaact taatcgcctt 
aatagcgaag aggcccgcac cgatcgccct 
tggcgcctga tgcggtattt tctccttacg 
tgcactctca gtacaatctg ctctgatgcc 
acacccgctg actatgcttg taaaccgttt 
acctctaggg tccccaatta attagtaata 
ccaccggatc agcttagtaa agccctcgct 
gccaactatt gcgataacaa gaaaaagcca 
agggcttatt atgcacgctt aaaaataata 
gagcaattat gtgcttagtg catctaacgc 
ggcttgaacg aattgttaga cattatttgc 
agtggacaaa ttcttccaac tgatctgcgc 
gataagcctg tctagcttca agtatgacgg 
cccagtcggc agcgacatcc ttcggcgcga 
gggacaacgt aagcactaca tttcgctcat 
gcgttaaggt ttcatttagc gcctcaaata 
cctccgccgc tggacctacc aaggcaacgc 
ccagatcaat gtcgatcgtg gctggctcga 
attctccaaa ttgcagttcg cgcttagctg 
caacaatggt gacttctaca gcgcggagaa 
ccaaaaggtc gttgatcaaa gctcgccgcg 
ccagcaaatc aatatcactg tgtggcttca 
gtacggccag caacgtcggt tcgagatggc 
gagtcgatac ttcggcgatc accgcttccc 
tgccctgctg cgtaacatcg ttgctgctcc 
gcgcttgctg cttggatgcc cgaggcatag 
ccatgaaaac cgccactgcg ccgttaccac 
tgcgtgagcg catacgctac ttgcattaca 
gggttcgtgc cttcatccgt ttccacggtg 
aagtcgaggc atttctgtcc tggctggcga 
gtcaggcatt ggcggccttg ctgttcttct 
ggcttcagga gatcggaaga cctcggccgt 
atgaagtggt tcgcatcctc ggtttt ctgg 
tgtatggaac gggcatgcgg atcagtgagg 
tcgatcacgg cacgatcatc gtgcgggagg 
tacccgagag cttggcaccc agcctgcgcg 
ttgaatgacc tttaatagat tatattacta 
ttattttaaa aattttttca caaaacggtt 
acgggctgtt ctggtgttgc tagtttgtta 
cggctgaaag cgctatttct tccagaattg 
tggctcccgt gttgtcggca gctttgattc 
tatgtgtgac tgttgagctg taacaagttg 
ctttgtttta ctggtttcac ctgttctatt 
gtcgatctgt tcatggtgaa cagctttaaa 
atctatcttt tttacaccgt tttcatctgt 
acggtgaaca gttgttctac ttttgtttgt 
agccataaga acctcagatc cttccgtatt 
gtttttgcgt gagccatgag aacgaaccat 
aaaattttgc ctcaaaactg gtgagctgaa 
ttcttagtcc gttacgtagg taggaatctg 
tcatttttat ctggttgttc tcaagttcgg 
cttggaaaat caacgtatca gtcgggcggc 
tgtaagtgtt taaatcttta cttattggtt 
catggtagtt attttcaagc attaacatga 
ttgccttgtg agttttcttt tgtgttagtt 
agtatttgtt ttcaaaagac ttaacatgtt 
ggaaaagata aggcaatatc tcttcactaa 



tcaatagcct cgagttatgg cagttggtta 4740 
aacggtcttt ttcgatttct ttttacagtc 4800 
catgccttat aactgctgtt tcggtcgaca 4860 
gccgtcgttt tacaacgtcg tgactgggaa 4920 
gcagcacatc cccctttcgc cagctggcgt 4980 
tcccaacagt tgcgcagcct gaatggcgaa 5040 
catctgtgcg gtatttcaca ccgcatatgg 5100 
gcatagttaa gccagccccg acacccgcca 5160 
tgtgaaaaaa tttttaaaat aaaaaagggg 5220 
taatctatta aaggtcattc aaaaggtcat 5280 
agattttaat gcggatgttg cgattacttc 5340 
gcctttcatg atatatctcc caatttgtgt 5400 
aaagcagact tgacctgata gtttggctgt 54 60 
ttgagttaag ccgcgccgcg aagcggcgtc 5520 
cgactacctt ggtgatctcg cctttcacgt 5580 
gcgaggccaa gcgatcttct tcttgtccaa 5640 
gctgatactg ggccggcagg cgctccattg 5700 
ttttgccggt tactgcgctg taccaaatgc 57 60 
cgccagccca gtcgggcggc gagttccata 5820 
gatcctgttc aggaaccgga tcaaagagtt 5880 
tatgttctct tgcttttgtc agcaagatag 5940 
agatacctgc aagaatgtca ttgcgctgcc 6000 
gataacgcca cggaatgatg tcgtcgtgca 6060 
tctcgctctc tccaggggaa gccgaagttt 6120 
ttgtttcatc aagccttacg gtcaccgtaa 6180 
ggccgccatc cactgcggag ccgracaaat 6240 
gctcgatgac gccaactacc tctgatagtt 6300 
tcatgatgtt taactttgtt ttagggcgac 6360 
ataacatcaa acatcgaccc acggcgtaac 6420 
actgtacccc aaaaaaacag tcataacaag 6480 
cgctgcgttc ggtcaaggtt ctggaccagt 6540 
gcttacgaac cgaacaggct taigtccact 6600 
tgcgtcaccc ggcaaccttg ggcagcagcg 6660 
acgagcgcaa ggtttcggtc tccacgcatc 6720 
acggcaaggt gctgtgcacg gatctgccct 6780 
cgcggcgctt gccggtggtg ctgaccccgg 684 0 
aaggcgagca tcgtttgttc gcccagcttc 6900 
gtttgcaact gcgggtcaag gatctggatt 6960 
gcaagggctc caaggatcgg gccttgatgt 7020 
agcaggggaa ttgatccggt ggatgacctt 7080 
attaattggg gaccctagag gtcccctttt 7140 
tacaagcata acgggttttg ctgcccgcaa 7200 
tcagaatcgc agatccggct tcaggtttgc 7260 
ccatgatttt ttccccacgg gaggcgtcac 7320 
gataagcagc atcgcctgtt tcaggctgtc 7380 
tctcaggtgt tcaatttcat gttctagttg 7440 
aggtgttaca tgctgttcat ctgttacatt 7500 
tgcaccaaaa actcgtaaaa gctctgatgt 7560 
gcatatggac agttttccct ttgatatcta 7620 
tagtcttgat gcttcactga tagatacaag 7680 
tagccagtat gttctctagt gtggttcgtt 7740 
tgagatcatg cttactttgc atgtcactca 7800 
tttttgcagt taaagcatcg tgtagtgttt 7860 
atgtaatggt tgttggtatt ttgtcaccat 7920 
ttacgagatc catttgtcta tctagttcaa 7980 
ctcgcttatc aaccaccaat ttcatattgc 8040 
tcaaaaccca ttggttaagc cttttaaact 8100 
acttaaattc atcaaggcta atctctatat 8160 
cttttaataa ccactcataa atcctcatag 8220 
ccagattata ttttatgaat ttttttaact 8280 
aaactaattc taatttttcg cttgagaact 8340 
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tggcatagtt 
cagttctcgt 
tgatgttcat 
tagggttttc 
gctccgttaa 
acatacatct 
aatgataatt 
ttgctggaaa 
ttttttttgt 
taaaaagaat 
acaaaaggat 
gcttaagtag 
atcaggcacc 
gcagtgaatg 
cccataatac 
tgtggtgcta 
ccacttcgga 
ggtatcatca 



tgtccactgg 
catcagctct 
catctgagcg 
aatcgtgggg 
gtcatagcga 
caattggtct 
actagtcctt 
acttgtaaat 
ttatattcaa 
agatcccagc 
gtcgcaaacg 
caccctcgca 
tgagtcgctg 
ggggtaaatg 
aagaaaagcc 
tctgactttt 
ttatcccgtg 
acaggcttac 



aaaatctcaa 
ctggttgctt 
tattggttat 
ttgagtagtg 
ctaatcgcta 
aggtgatttt 
ttcctttgag 
tctgctagac 
gtggttataa 
cctgtgtata 
ctgtttgctc 
agctcgggca 
tctttttcgt 
gcactacagg 
cgtcacgggc 
tgctgttcag 
acaggtcatt 
ccgtcttact 



agcctttaac 
tagctaatac 
aagtgaacga 
ccacacagca 
gttcatttgc 
aatcactata 
ttgtgggtat 
cctctgtaaa 
tttatagaat 
actcactact 
ctctacaaaa 
aatcgctgaa 
gacattcagt 
cgccttttat 
ttctcagggc 
cagttcctgc 
cagactggct 
gtcaac 



caaaggattc 
accataagca 
taccgtccgt 
taaaattagc 
tttgaaaaca 
ccaattgaga 
ctgtaaattc 
ttccgctaga 
aaagaaagaa 
ttagtcagtt 
cagaccttaa 
tattcctttt 
tcgctgcgct 
ggattcatgc 
gttttatggc 
cctctgattt 
aatgcaccca 



ctgatttcca 
ttttccctac 
tctttccttg 
ttggtttcat 
actaattcag 
tgggctagtc 
tgctagacct 
cctttgtgtg 
taaaaaaaga 
ccgcagtatt 
aaccctaaag 
gtctccgacc 
cacggctctg 
aaggaaacta 
gggtctgcta 
tccagtctga 
gtaaggcagc 



8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9300 
9360 
9396 
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<220> 

<223> Description of Artificial Sequence: plasmid, pAN329 and pAN330 



<400> 72 

ttgcggccgc 

atcccttaac 

aactaatgac 

gatcaacttc 

caattgcaat 

caccagcttc 

acgaaagccc 

aaagaaagga 

ataaaggacc 

ttgttccagg 

tcatcatata 

aaagaactcc 

ttactttcct 

aaatgctata 

ttcctttgat 

acggaacagt 

gggctatcat 

taagaccaaa 

cgagtgctgg 

aaaagaaaga 

ataggagtaa 

tggaaaaggt 

aat catttgc 

aagataaaat 

atgtattcac 

cgcttccctc 

aaacttgatt 

ttatacaaca 

caattccaca 

tgaatcaaat 

aaaccaccat 

ttcaaattct 

gtttcgtcaa 



ttcgaactgt 
gatttagaaa 
aatgattcct 
cataggtaac 
acctgttccc 
ctgctgtttc 
acctaaaaca 
agcagttaaa 
tcttctatca 
gaaaataatt 
aggaaccata 
aatcataaaa 
gatatgtttt 
tgatgttgta 
atcaaaatga 
gataattgtt 
tccaccaatc 
tgctttaccc 
aaatgcagct 
atggccaaca 
ccttttgatg 
taacataaag 
aatatcaggt 
gcaaagccaa 
atttcaccct 
ttttaattga 
aaaacatttt 
atatggcccg 
ttgcaataat 
cgccttcttc 
aggagattaa 
tttcttcatc 
ttgccgattg 



tataaaaaaa 
tccctttgag 
gaaaaaagta 
aacctttgat 
tctgataaaa 
aagctacttg 
aatacgatta 
aagctaacag 
acaagtatcc 
acacttccga 
gagacaaacc 
ggtatatttt 
acaaatatca 
aacaacataa 
ccttttatcc 
atcataggaa 
gctggaccga 
ctattttcct 
gcaccagccc 
aacccaatta 
cctaattgat 
gctgtgttca 
aatgagacgt 
attaaaattt 
ccaataatga 
accctgttac 
accacatata 
tttgttgaac 
agaaaatcca 
tgtgtcatca 
ccttttacgg 
atcggtcata 
tatatccgat 



ggat caattt 
aatgtttata 
ataacaaatt 
caagtaaggg 
agctggtaaa 
aaacaattgt 
taattgtcat 
aaagaaatgt 
caccaatgta 
tttcggcagt 
ctgctactgt 
tccctaatcc 
ggaatgacag 
aaaatacaat 
ttacttcttt 
tgagtagaag 
ctccttctcc 
ttggaatata 
cttgaataaa 
ccgacccgaa 
cagatagctt 
cccagtttgt 
tcaaaaccat 
ggttgtgtcg 
gggcagacgt 
attcattaca 
aactaagttt 
tactctttaa 
tcttcatcgg 
aggtttaatt 
tgtaaacctt 
aaatccgtat 
ttatatttat 



tgaactctct 
tacattcaag 
actatacaga 
ta tggataat 
gttaagcaaa 
tgatataact 
gaaccatgat 
aactccgatg 
gccgaaaata 
acttagctgg 
tccaaatata 
gggatcaaca 
cacgctaacg 
gcctacagac 
ctttaataat 
ataggaccaa 
catggctact 
gcgcgcaact 
acgagccata 
acaatttatt 
tccatataca 
actcgcaggt 
ttcatttaat 
taaattcgat 
agtttatagg 
cttcataatt 
taaattcagt 
taaaataatt 
ctttttcgtc 
ttttatgtat 
cctccaaatc 
cctttacagg 
ttttcggtcg 



cccaaagttg 
gtaaccagcc 
taagttgact 
aaaccaccta 
ctcatt ccag 
gttttggtga 
gttgtttcta 
tttaacacgt 
atgacactca 
tgaacatctt 
attcccccac 
aaaggatctg 
ataagaaaag 
attagtataa 
ttcataagaa 
tgaatataat 
atcgatccaa 
acaaccatta 
ataagtaagg 
ataattccaa 
gctgttccaa 
ggtttattaa 
acgctaaaaa 
tgtgaatagg 
gttaatgata 
aattcctcct 
atttcatcac 
tttccgttcc 
atcatctgta 
ttcttttaac 
agacaaacgt 
atattttgca 
aatcatttga 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 
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acttttacat ttggatcata gtctaatttc 
ttttgattca cgtagttttc tgtattctta 
tgcatgtgct gattataaga attatcttta 
aaaccaacaa gatttttatt aattttttta 
atatctgaca aactcttatt taattcttcg 
agaaacaacc aacgaactgt tggcttttgt 
tgaatgccat gtttcattgc tctcctccag 
aaccacactc gatacaactt tctttcgcct 
tcagcacaat cttttactct ttcagccttt 
taatcaacat tagcgatttt cttttctctc 
gttgcaaacg attctggata gtttgccgga 
aagagatccg tattccccgc tgaaaactca 
accatgtctc tgtaccattc atcaagtcca 
tcaggagcca tttccgggtg gggatttcct 
ccagcttcca aaatgtcagc atcagtgccg 
cattttgcgc caggtgctac accgatttga 
gtgcctgtca cgtgggttcc atgagccaaa 
gcatcatacc agt tcatttc attttcaggc 
tatttctctt ttaatgccgg atgattccat 
gtgccagttc catcatatcc aagtgcccaa 
cattccacac cgtcagttgc tttaatagct 
gatgacttaa aaagctgccg tttctcatta 
accacttttt ccataacctc ttttgaggca 
gaatgaattt ggtctgcatt tcctttatct 
tgctgggatt catcggctgt gacttttaaa 
tccgtcttag cggcagacag cttcttcgat 
ttttctgggt acccctatgt attactagaa 
ttcaatggag gtgatgtccg tttttcatta 
ctcttggacc cgggatccac gatgtctata 
aggcctcctg tcgcaatgac cttgggtcct 
ttccttccac ttggccgaca tagccaaata 
ttttrccgat aatattgtcg ggccgggtga 
agtaaagcgc ctctgtcgaa attgtaatcc 
tgttttcatc aatatagcag tacgttgtgg 
tgccgtacaa gtgtatcgca gcgacagcat 
gattgtcata ttttatattt aaaccggttt 
tatgaaagta ttttgtgcac attctttcta 
aaataataat gccatctatc tgttcaaaca 
aaatcatccc aaactcatct tctgttttat 
ctaattttcc atcatgatat acaccaagta 
gtaacaacct ctatcaccac ttttgaatag 
ctgtaattaa attattacac accctgtagg 
cacagcgtgg atctcttcta gggacacctc 
gtactgagca cagttttggt tggtggacag 
ttatggcagt tggttaaaag gaaacaaaaa 
atttcttttt acagtcacag ccacttttgc 
gctgtttcgg tcgacaagct tcgcgaagcg 
acgtcgtgac tgggaaaacc ctggcgttac 
tttcgccagc tggcgtaata gcgaagaggc 
cagcctgaat ggcgaatggc gcctgatgcg 
ttcacaccgc atatggtgca ctctcagtac 
gccccgacac ccgccaacac ccgctgacta 
taaaataaaa aaggggacct ctagggtccc 
tcattcaaaa ggtcatccac cggatcagct 
atgttgcgat tacttcgcca actattgcga 
atctcccaat ttgtgtaggg cttattatgc 
ctgatagttt ggctgtgagc aattatgtgc 
gccgcgaagc ggcgtcggct tgaacgaatt 
atctcgcctt tcacgtagtg gacaaattct 
tcttcttctt gtccaagata agcctgtcta 
ggcaggcgct ccattgccca gtcggcagcg 



attgcctttt tccaaaattg aatccattgt 2040 

aaataagttg gttccacaca taccaataca 2100 

ttatttattg tcacttccgt tgcacgcata 2160 

tattgcatca ttcggcgaaa tccttgagcc 2220 

ccatcataaa catttttaac tgttaatgtg 2280 

ttaataactt cagcaacaac cttttgtgac 2340 

ttgcacattg gacaaagcct ggatttacaa 2400 

gtttcacgat tttgtttata ctctaatatt 2460 

ttaaattcaa gaatatgcag aagttcaaag 2520 

catggggaat tggaattctc agtcgctcca 2580 

tttgcgatag aaccaggccc gccgggaata 2 64 0 

gggaaaatat cggccgaacg ccaggcattg 2700 

gagccccctc cccatgagtt attgacaaca 27 60 

tccgcgtcct ttggtgctaa aacccattca 2820 

ccatcttcag agaacgcttt aacagcaatc 2880 

tttgttccat caggttcaga gcccaccatc 2940 

tcatcataag ggcttgcctc gcctgctacg 3000 

tcattaggat tttccggatt atatccgcga 3060 

tccaccccgg tatcaatgga cgcaacaacc 3120 

gcttttgggg catcgatttg gtctacattc 3180 

ttctgtgctt ttttcatatt aaatggggag 3240 

ggaagcacct tttccacttc gggaaactgc 3300 

tgaacagcaa tcccgttcac cacataataa 3360 

ttctgggtgt tcaagtattt taggacatct 3420 

gatgacacaa cagcagaacg cttttgatat 3480 

ttcgcttttt taacagccgc ttttgccgct 3540 

aataacatag taaaacggac atcactccgt 3600 

caacaaatta cttatctatt tgtaatgctg 3660 

caatctgatt cgttcgcaat gagcggcgcc 3720 

gttctgcctg ccatttcatt cgcttaacga 3780 

aaattccaga ttgcatcgcg ctaacagtgt 3840 

tttcgatacg aggaagcttt gctgcacgcg 3900 

caggggcaat cgccccgccc atgtattgtt 3960 

cggttccgaa atcgacaaca attaatggat 4020 

ttacgattct gtctgcccct acttcttt eg 4080 

tcatacctgg accaacaatt tgaggctcga 4140 

aegcaaacat gattggegge actactgacg 4200 

taagcccgga gtgatcaaat aaggagegea 4260 

gectgettgt ttctataege cagtgatatt 4320 

cagtattggt gttccccaca tcgataacca 4380 

tttctctaga acaggegggg ttgcccccgc 4440 

gaaagtcaat acctttttgt aaaattttta 4500 

tttgtacccc tcaagggaga aatattggcg 4560 

tgaaccatag ctgtcgtcaa tagectcgag 4 620 

gaeegtttte acacaaaacg gtctttttcg 4 680 

aaaaaccgga cagcttcatg ccttataact 4740 

geegcaaaat tcactggccg tegttttaca 4800 

ccaacttaat cgccttgcag cacatccccc 4860 

ccgcaccgat cgcccttccc aacagttgcg 4 920 

gtattttctc cttacgcatc tgtgcggtat 4980 

aatctgetet gatgeegcat agttaagcca 5040 

tgcttgtaaa ccgttttgtg aaaaaatttt 5100 

caattaatta gtaatataat ctattaaagg 5160 

tagtaaagee ctegctagat tttaatgegg 5220 

taacaagaaa aagccagcct ttcatgatat 5280 

aegcttaaaa ataataaaag cagacttgac 5340 

ttagtgcatc taacgcttga gttaagcege 5400 

gttagacatt atttgecgae taccttggtg 5460 

tccaactgat ctgcgcgcga ggecaagega 5520 

gcttcaagta tgacgggctg atactgggee 5580 

acatccttcg gcgcgatttt geeggttact 5640 
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gcgctgtacc 
ggcggcgagt 
accggatcaa 
tttgtcagca 
atgtcattgc 
atgatgtcgt 
ggggaagccg 
cttacggtca 
gcggagccgt 
actacctctg 
tttgttttag 
cgacccacgg 
aaacagtcat 
aaggttctgg 
caggcttatg 
accttgggca 
tcggtctcca 
tgcacggatc 
gtggtgctga 
ttgttcgccc 
gtcaaggatc 
gatcgggcct 
tccggtggat 
ctagaggtcc 
gttttgctgc 
.ccggcttcag 
ccacgggagg 
cctgtttcag 
tttcatgttc 
gttcatctgt 
gtaaaagctc 
ttccctttga 
cactgataga 
tctagtgtgg 
ctttgcatgt 
gcatcgtgta 
ggtatt ttgt 
tgtctatcta 
accaatttca 
ttaagccttt 
aggctaatct 
tcataaatcc 
atgaattttt 
ttttcgcttg 
ggattcctga 
taagcatttt 
gtccgttctt 
attagcttgg 
aaaacaacta 
ttgagatggg 
aaattctgct 
gctagacctt 
aaagaataaa 
tcagttccgc 
ccttaaaacc 
ccttttgtct 
tgcgctcacg 
tcatgcaagg 
tatggcgggt 
tgattttcca 
cacccagtaa 



aaatgcggga 
tccatagcgt 
agagttcctc 
agatagccag 
gctgccattc 
cgtgcacaac 
aagtttccaa 
ccgtaaccag 
acaaatgtac 
atagttgagt 
ggcgactgcc 
cgtaacgcgc 
aacaagccat 
accagttgcg 
tccactgggt 
gcagcgaagt 
cgcatcgtca 
tgccctggct 
ccccggatga 
agcttctgta 
tggatttcga 
tgatgttacc 
gaccttttga 
ccttttttat 
ccgcaaacgg 
gtttgccggc 
cgtcactggc 
gctgtctatg 
tagttgcttt 
tacattgtcg 
tgatgtatct 
tatctaacgg 
tacaagagcc 
ttcgttgttt 
cactcaaaaa 
gtgtttttct 
caccattcat 
gctcaacttg 
tattgctgta 
taaactcatg 
ctatatttgc 
tcatagagta 
ttaactggaa 
agaacttggc 
tttccacagt 
ccctactgat 
tccttgtagg 
tttcatgctc 
attcagacat 
ctagtcaatg 
agacctttgc 
tgtgtgtttt 
aaaagataaa 
agtattacaa 
ctaaaggctt 
ccgaccatca 
gctctggcag 
aaactaccca 
ctgctatgtg 
gtctgaccac 
ggcagcggta 



caacgtaagc 
taaggtttca 
cgccgctgga 
atcaatgtcg 
tccaaattgc 
aatggtgact 
aaggtcgttg 
caaatcaata 
ggccagcaac 
cgatacttcg 
ctgctgcgta 
ttgctgcttg 
gaaaaccgcc 
tgagcgcata 
tcgtgccttc 
cgaggcattt 
ggcattggcg 
tcaggagatc 
agtggttcgc 
tggaacgggc 
tcacggcacg 
cgagagcttg 
atgaccttta 
tttaaaaatt 
gctgttctgg 
tgaaagcgct 
tcccgtgttg 
tgtgactgtt 
gttttactgg 
atctgttcat 
atctttttta 
tgaacagttg 
ataagaacct 
ttgcgtgagc 
ttttgcctca 
tagtccgtta 
ttttatctgg 
gaaaatcaac 
agtgtttaaa 
gtagttattt 
cttgtgagtt 
tttgttttca 
aagataaggc 
atagtttgtc 
tctcgtcatc 
gttcatcatc 
gttttcaatc 
cgttaagtca 
acatctcaat 
ataattacta 
tggaaaactt 
ttttgtttat 
aagaatagat 
aaggatgtcg 
aagtagcacc 
ggcacctgag 
tgaatggggg 
taatacaaga 
gtgctatctg 
ttcggattat 
tcatcaacag 



actacatttc 
tttagcgcct 
cctaccaagg 
atcgtggctg 
agttcgcgct 
tctacagcgc 
atcaaagctc 
tcactgtgtg 
gtcggttcga 
gcgatcaccg 
acatcgttgc 
gatgcccgag 
actgcgccgt 
cgctacttgc 
atccgtttcc 
ctgtcctggc 
gccttgctgt 
ggaagacctc 
atcctcggtt 
atgcggatca 
atcatcgtgc 
gcacccagcc 
atagattata 
ttttcacaaa 
tgttgctagt 
atttcttcca 
tcggcagctt 
gagctgtaac 
ttt cacctgt 
ggtgaacagc 
caccgtt ttc 
ttctactttt 
cagatccttc 
catgagaacg 
aaactggtga 
cgtaggtagg 
ttgttctcaa 
gtatcagtcg 
tctttactta 
tcaagcatta 
ttcttttgtg 
aaagacttaa 
aatatctctt 
cactggaaaa 
agctctctgg 
tgagcgtatt 
gtggggttga 
tagcgactaa 
tggtctaggt 
gtccttttcc 
gtaaattctg 
attcaagtgg 
cccagccctg 
caaacgctgt 
ctcgcaagct 
tcgctgtctt 
taaatggcac 
aaagcccgtc 
actttttgct 
cccgtgacag 
gcttacccgt 



gctcatcgcc 
caaatagatc 
caacgctatg 
gctcgaagat 
tagctggata 
ggagaatctc 
gccgcgttgt 
gcttcaggcc 
gatggcgctc 
cttccctcat 
tgctccataa 
gcatagactg 
taccaccgct 
attacagctt 
acggtgtgcg 
tggcgaacga 
tcttctacgg 
ggccgtcgcg 
t tctggaagg 
gtgagggttt 
gggagggcaa 
tgcgcgagca 
ttactaatta 
acggtttaca 
ttgttatcag 
gaattgccat 
tgattcgata 
aagttgtctc 
tctattaggt 
tttaaatgca 
atctgtgcat 
gt ttgt tagt 
cgtatttagc 
aaccattgag 
gctgaatttt 
aatctgatgt 
gttcggttac 
ggcggcctcg 
ttggtttcaa 
acatgaactt 
ttagttcttt 
catgttccag 
cactaaaaac 
tctcaaagcc 
t tgctttagc 
ggt tataagt 
gtagtgccac 
tcgctagttc 
gattttaatc 
tttgagttgt 
ctagaccctc 
ttataattta 
tgtataactc 
ttgctcctct 
cgggcaaatc 
tttcgtgaca 
tacaggcgcc 
acgggcttct 
gttcagcagt 
gtcattcaga 
cttactgtca 



agcccagtcg 
ctgttcagga 
ttctcttgct 
acctgcaaga 
acgccacgga 
gctctctcca 
ttcatcaagc 
gccatccact 
gatgacgcca 
gatgtttaac 
catcaaacat 
taccccaaaa 
gcgttcggtc 

acgaaccgaa 
tcacccggca 
gcgcaaggtt 
caaggtgctg 
gcgcttgccg 
cgagcatcgt 
gcaactgcgg 
gggctccaag 
ggggaattga 
attggggacc 
agcataacgg 
aatcgcagat 
gattttttcc 
agcagcatcg 
aggtgttcaa 
gttacatgct 
ccaaaaactc 
acggacagtt 
cttgatgct c 
cagtatgttc 
atcatgctta 
tgcagttaaa 
aatggttgtt 
gagatccatt 
cttatcaacc 
aacccattgg 
aaattcatca 
taataaccac 
attatatttt 
taattctaat 
tttaaccaaa 
taatacacca 
gaacgatacc 
acagcataaa 
atttgctttg 
actataccaa 
gggtatctgt 
tgtaaattcc 
tagaataaag 
actactttag 
acaaaacaga 
gctgaatatt 
ttcagttcgc 
ttttatggat 
cagggcgttt 
tcctgccctc 
ctggctaatg 



5700 
5760 
5820 
5880 
5940 
6000 
6060 
6120 
6180 
6240 
6300 
6360 
6420 
6480 
6540 
6600 
6660 
6720 
6780 
6840 
6900 
6960 
7020 
7080 
7140 
7200 
7260 
7 32 0 
7380 
7440 
7500 
7560 
7620 
7680 
7740 
7800 
7860 
7920 
7980 
8040 
8100 
8160 
8220 
8280 
8340 
8400 
8460 
8520 
8580 
8640 
8700 
8760 
8820 
8880 
8940 
9000 
9060 
9120 
9180 
9240 
9292 



OGZ-001 



-58- 



<210> 73 
<211> 3964 
<212> DNA 

<213> Artificial Sequence 



<220> 

<223> Description of Artificial Sequence: plasmid, pOTP71 



<400> 73 

ccatcgaatg 

tattttacca 

agaaaaggag 

ggtgtatatc 

acagaagatg 

gaacagatag 

agaatgtgca 

ggtttaaata 

gtcgctgcga 

acgtactgct 

acaatttcga 

cggcccgaca 

ggctatgtcg 

cccaaggtca 

gacatcgttg 

gtaggaagtg 

tggagccacc 

attgtgcgac 

cctgtagcgg 

ttgccagcgc 

ccggctttcc 

tacggcacct 

cctgatagac 

tgttccaaac 

ttttgccgat 

attttaacaa 

gaacccctat 

aaccctgata 

gtgtcgccct 

cgctggtgaa 
tggatctcaa 
tgagcacttt 
agcaactcgg 
cagaaaagca 
tgagtgataa 
ccgctttttt 
tgaatgaagc 
cgt tgcgcaa 
act ggatgga 
ggtttattgc 
tggggccaga 
ctatggatga 
aggaattaat 
ttaatgaggt 
agcagcctac 
ttgagatgtt 
attttttacg 
caaaagtaca 
tagccttttt 
tggggcattt 
aaagggaaac 



gccagatgat 
ctccctatca 
gaatttaaat 
atgatggaaa 
agtttgggat 

atggcattat 
caaaatactt 
taaaatatga 
tacacttgta 
atattgatga 
cagaggcgct 
atattatcgg 
gccaagtgga 
ttgcgacagg 
atccattctt 
tataaggatc 
cgcagttcga 
attttttttg 
cgcattaagc 
cctagcgccc 
ccgtcaagct 
cgaccccaaa 
ggtttttcgc 
tggaacaaca 
ttcggcctat 
aatattaacg 
ttgtttattt 
aatgcttcaa 
tattcccttt 
agtaaaagat 
cagcggtaag 
taaagttctg 
tcgccgcata 
tcttacggat 
cactgcggcc 
gcacaacatg 
cataccaaac 
actattaact 
ggcggataaa 
tgataaatct 
tggtaagccc 
acgaaataga 
gatgtctcgt 
cggaatcgaa 
attgtattgg 
agataggcac 
taataacgct 
tttaggtaca 
atgccaacaa 
tactttaggt 
acctactact 



taattcctaa 
gtgatagaga 
gttactggtt 
attagaatat 
gattttgcgc 
tatttcgtca 
tcatatcgag 
caatccgaaa 
cggcaatcca 
aaacaaacaa 
ttactcgcgt 
aaaaaacact 
aggaatcgtt 
aggcctggcg 
aaccctaaaa 
cctcgaggtc 
aaaataataa 
tctgccgttt 
gcggcgggtg 
gctcctttcg 
ctaaatcggg 
aaacttgatt 
cctttgacgt 
ctcaacccta 
tggttaaaaa 
cttacaattt 
ttctaaatac 
taatattgaa 
tttgcggcat 
gctgaagatc 
atccttgaga 
ctatgtggcg 
cactattctc 
ggcatgacag 
aacttacttc 
ggggatcatg 
gacgagcgtg 
ggcgaactac 
gttgcaggac 
ggagccggtg 
tcccgtatcg 
cagatcgctg 
ttagataaaa 
ggtttaacaa 
catgtaaaaa 
catactcact 
aaaagtttta 
cggcctacag 
ggtttttcac 
tgcgtattgg 
gatagtatgc 



tttttgttga 
aaagtgaaat 
atcgatgtgg 
cactggcgta 
tccttatttg 
gtagtgccgc 
cctcaaattg 
gaagtagggg 
ttaattgttg 
tacatgggcg 
gcagcaaagc 
gttagcgcga 
aagcgaatga 
ccgctcattg 
gggctggaat 
gacctgcagg 
gcttgacctg 
accgctactg 
tggtggttac 
ctttcttccc 
ggctcccttt 
agggtgatgg 
tggagtccac 
tctcggtcta 
atgagctgat 
caggtggcac 
attcaaatat 
aaaggaagag 
tttgccttcc 

agttgggtgc 
gttttcgccc 
cggtattatc 
agaatgactt 
taagagaatt 
tgacaacgat 
taactcgcct 
acaccacgat 
ttactctagc 
cacttctgcg 
agcgtggctc 
tagttatcta 
agataggtgc 
gtaaagtgat 
cccgtaaact 
ataagcgggc 
tttgcccttt 
gatgtgcttt 
aaaaacagta 
tagagaatgc 
aagatcaaga 
cgccattatt 



cactctatca 
gaatagttcg 
ggaacaccaa 
tagaaacaag 
atcactccgg 
caatcatgtt 
ttggtccagg 
cagacagaat 
tcgatttcgg 
gggcgattgc 
ttcctcgtat 
tgcaatctgg 
aatggcaggc 
cgaacgaatc 
tgatttatga 
gggaccatgg 
tgaagtgaaa 
cgtcacggat 
gcgcagcgtg 
ttcctttctc 
agggttccga 
ttcacgtagt 
gttctttaat 
ttcttttgat 
ttaacaaaaa 
ttttcgggga 
gtatccgctc 
tatgagtatt 
tgtttttgct 

acgagtgggt 
cgaagaacgt 
ccgtattgac 
ggttgagtac 
atgcagtgct 
cggaggaccg 
tgatcgttgg 
gcctgtagca 
ttcccggcaa 
ctcggccctt 
tcgcggtatc 
cacgacgggg 
ctcactgatt 
taacagcgca 
cgcccagaag 
tttgctcgac 
agaaggggaa 
actaagtcat 
tgaaactctc 
attatatgca 
gcatcaagtc 
acgacaagct 



ttgatagagt 
acaaaaatct 
tactgtactt 
caggcataaa 
gcttatgttt 
tgcgttagaa 
tatgaaaacc 
cgtaaatgct 
aaccgccaca 
ccctgggatt 
cgaaatcacc 
aattttattt 
aaaacaggac 
agattgtata 
aagaaaccgc 
tctcagcgct 
aatggcgcac 
ctccacgcgc 
accgctacac 
gccacgtt eg 
t ttagtgctt 
gggccatcgc 
agtggactct 
ttataaggga 
tttaacgega 
aatgtgcgcg 
atgagacaat 
caacatttcc 
cacccagaaa 
tacatcgaac 
tttccaatga 
geegggcaag 
tcaccagtca 
gccataacca 
aaggagctaa 
gaaceggage 
atggcaacaa 
caattgatag 
ccggctggct 
attgeagcac 
agtcaggcaa 
aagcattggt 
ttagagctgc 
ctaggtgtag 
gccttagcca 
agctggcaag 
cgcgatggag 
gaaaatcaat 
ctcagcgcag 
gctaaagaag 
atcgaattat 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1 620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 

2160 

2220 

2280 

2340 

2400 

2460 

2520 

2580 

2640 

2700 

2760 

2820 

2880 

2940 

3000 

3060 



OGZ-001 
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ttgatcacca 
tagaaaaaca 
ggtaacttca 
aaatccctta 
gatcttcttg 
cgctaccagc 
ctggcttcag 
accacttcaa 
tggctgctgc 
cggataaggc 
gaacgaccta 
ccgaagggag 
cgagggagct 
tctgacttga 
ccagcaacgc 
gaca 



aggtgcagag 
acttaaatgt 
ctagtttaaa 
acgtgagttt 
agatcctttt 
ggtggtttgt 
cagagcgcag 
gaactctgta 
cagtggcgat 
gcagcggtcg 
caccgaactg 
aaaggcggac 
tccaggggga 
gcgtcgattt 
ggccttttta 



ccagccttct 
gaaagtgggt 
aggatctagg 
tcgttccact 
tttctgcgcg 
ttgccggatc 
ataccaaata 
gcaccgccta 
aagtcgtgtc 
ggctgaacgg 
agatacctac 
aggtatccgg 
aacgcctggt 
ttgtgatgct 
cggttcctgg 



tattcggcct 
cttaaaagca 
tgaagatcct 
gagcgtcaga 
taatctgctg 
aagagctacc 
ctgtccttct 
catacctcgc 
ttaccgggtt 
ggggttcgtg 
agcgtgagct 
taagcggcag 
atctttatag 
cgtcaggggg 
ccttttgctg 



tgaattgatc 
gcataacctt 
ttttgataat 
ccccgtagaa 
cttgcaaaca 
aactcttttt 
agtgtagccg 
tctgctaatc 
ggactcaaga 
cacacagccc 
atgagaaagc 
ggtcggaaca 
tcctgtcggg 
gcggagccta 
gccttttgct 



atatgcggat 
tttccgtgat 
ctcatgacca 
aagatcaaag 
aaaaaaccac 
ccgaaggtaa 
tagttaggcc 
ctgttaccag 
cgatagttac 
agcttggagc 
gccacgcttc 
ggagagcgca 
tttcgccacc 
tggaaaaacg 
cacatgaccc 



3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3900 
3960 
3964 



<210> 74 
<211> 3859 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> Description of Artificial Sequence: plasmid, pOTP72 



<400> 74 

ccatcgaatg 

tattttacca 

agaaaaggag 

ttgtgcgata 

gctaaagaag 

gaagaaaatg 

tttcatttag 

gtaaataatg 

ggcaagcatt 

aaaagcgcta 

aaaagcacta 

cat ttagcca 

ttt ttacccc 

aaaaaagcag 

catggtctca 

tgaaaaatgg 

cggatctcca 

gcgtgaccgc 

ttctcgccac 

tccgatttag 

gtagtgggcc 

ttaatagtgg 

ttgatttata 

aaaaatttaa 

ggggaaatgt 

cgctcatgag 

gtattcaaca 

ttgctcaccc 

tgggttacat 

aacgttttcc 

ttgacgccgg 

agtactcacc 

gtgctgccat 

gaccgaagga 

gttgggaacc 



gccagatgat 
ctccctatca 
gaatttaaat 
taggcaacac 
atttaaagcg 
aaaaagccct 
aaaccgacta 
gcgtggtggt 
taggagggtg 
aaattttaga 
gagacgctgt 
aaaatcaaaa 
attctgtttg 
ggatactaga 
gcgcttggag 
cgcacattgt 
cgcgccctgt 
tacacttgcc 
gttcgccggc 
tgctttacgg 
atcgccctga 
actcttgttc 
agggattttg 
cgcgaatttt 
gcgcggaacc 
acaataaccc 
tttccgtgtc 
agaaacgctg 
cgaactggat 
aatgatgagc 
gcaagagcaa 
agtcacagaa 
aaccatgagt 
gctaaccgct 
ggagctgaat 



taattcctaa 
gtgatagaga 
gccagctagg 
gcgtatccat 
tttgggtatt 
tttgaattgt 
tgtagggctt 
ggatgccggg 
tattttaccc 
gcaacctttc 
gaattacggc 
a atctatctt 
caaggagcgt 
atgcaaataa 
ccacccgcag 
gcgacatttt 
agcggcgcat 
agcgccctag 
tttccccgtc 
cacctcgacc 
tagacggttt 
caaactggaa 
ccgatttcgg 
aacaaaatat 
cctatttgtt 
tgataaatgc 
gcccttattc 
gtgaaagtaa 
ctcaacagcg 
acttttaaag 
ctcggtcgcc 
aagcatctta 
gataacactg 
tttttgcaca 
gaagccatac 



tttttgttga 
aaagtgaaat 
caatctttta 
tttgcacaaa 
caaaaggaaa 
taccctaacg 
gggatagacc 
agtgcgatta 
ggtttagccc 
aaggccttag 
atggttttga 
tgtgggggcg 
ttggtttttg 
ggatccctcg 
ttcgaaaaat 
ttttgtctgc 
taagcgcggc 
cgcccgctcc 
aagctctaaa 
ccaaaaaact 
ttcgcccttt 
caacactcaa 
cctattggtt 
taacgcttac 
tatttttcta 
ttcaataata 
ccttttttgc 
aagatgctga 
gtaagatcct 
ttctgctatg 
gcatacacta 
cggatggcat 
cggccaactt 
acatggggga 
caaacgacga 



cactctatca 
gaatagttcg 
cagatttgaa 
actatcagct 
ttttttacat 
ctaaaaatat 
ggcaaatggc 
cgatagattt 
aatatattca 
attctttaga 
gcgtcattgc 
atgcgaagta 
acgggatgga 
aggtcgacct 
aataagcttg 
cgtttaccgc 
gggtgtggtg 
tttcgctttc 
tcgggggctc 
tgattagggt 
gacgttggag 
ccctatctcg 
aaaaaatgag 
aatttcaggt 
aatacattca 
ttgaaaaagg 
ggcattttgc 
agatcagttg 
tgagagtttt 
tggcgcggta 
ttctcagaat 
gacagtaaga 
acttctgaca 
tcatgtaact 
gcgtgacacc 



ttgatagagt 
acaaaaatct 
aaacctggtt 
cttttcaagc 
tagcgtgaat 
tgcagggttt 
gtgtctggcg 
aatcaaagag 
tgcgtataaa 
agttttacct 
rtgtatccag 
tttgagcgcg 
aatcgctctt 
gcagggggac 
acctgtgaag 
tactgcgtca 
gttacgcgca 
ttcccttcct 
cctttagggt 
gatggttcac 
tccacgttct 
gtctattctt 
ctgatttaac 
ggcacttttc 
aatatgtatc 
aagagtatga 
cttcctgttt 
ggtgcacgag 
cgccccgaag 
ttatcccgta 
gacttggttg 
gaattatgca 
acgatcggag 
cgccttgatc 
acgatgcctg 



60 

120 

18C 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 

1320 

1380 

1440 

1500 

1560 

1620 

1680 

1740 

1800 

1860 

1920 

1980 

2040 

2100 



OGZ-001 
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tagcaatggc 
ggcaacaatt 
cccttccggc 
gtatcattgc 
cggggagtca 
tgattaagca 
gcgcattaga 
agaagctagg 
tcgacgcctt 
gggaaagctg 
gtcatcgcga 
ctctcgaaaa 
atgcactcag 
aagtcgctaa 
aagctatcga 
tgatcatatg 
acctttttcc 
ataatctcat 
tagaaaaga t 
aaacaaaaaa 
tttttccgaa 
agccgtagtt 
taatcctgtt 
caagacgata 
agcccagctt 
aaagcgccac 
gaacaggaga 
tcgggtttcg 
gcctatggaa 
ttgctcacat 



aacaacgttg 
gatagactgg 
tggctggttt 
agcactgggg 
ggcaactatg 
ttggtaggaa 
gctgcttaat 
tgtagagcag 
agccattgag 
gcaagatttt 
tggagcaaaa 
tcaattagcc 
cgcagtgggg 
agaagaaagg 
attatttgat 
cggattagaa 
gtgatggtaa 
gaccaaaatc 
caaaggatct 
accaccgcta 
ggtaactggc 
aggccaccac 
accagtggct 
gttaccggat 
ggagcgaacg 
gcttcccgaa 
gcgcacgagg 
ccacctctga 
aaacgccagc 
gacccgaca 



cgcaaactat 
atggaggcgg 
attgctgata 
ccagatggta 
gatgaacgaa 
ttaatgatgt 
gaggtcggaa 
cctacattgt 
atgttagata 
ttacgtaata 
gtacatttag 
tttttatgcc 
cattttactt 
gaaacaccta 
caccaaggtg 
aaacaactta 
cttcactagt 
ccttaacgtg 
tcttgagatc 
ccagcggtgg 
ttcagcagag 
ttcaagaact 
gctgccagtg 
aaggcgcagc 
acctacaccg 
gggagaaagg 
gagcttccag 
cttgagcgtc 
a acgcggcct 



taactggcga 
ataaagttgc 
aatctggagc 
agccctcccg 
atagacagat 
ctcgtttaga 
tcgaaggttt 
attggcatgt 
ggcaccatac 
acgctaaaag 
gtacacggcc 
aacaaggttt 
taggttgcgt 
ctactgatag 
cagagccagc 
aatgtgaaag 
ttaaaaggat 
agttttcgtt 
ctttttttct 
tttgtttgcc 
cgcagatacc 
ctgtagcacc 
gcgataagtc 
ggtcgggctg 
aactgagata 
cggacaggta 
ggggaaacgc 
gatttttgtg 
ttttacggtt 



actacttact 
aggaccactt 
cggtgagcgt 
tatcgtagtt 
cgctgagata 
taaaagtaaa 
aacaacccgt 
aaaaaataag 
tcacttttgc 
ttttagatgt 
tacagaaaaa 
ttcactagag 
attggaagat 
tatgccgcca 
cttcttattc 
tgggtcttaa 
ctaggtgaag 
ccactgagcg 
gcgcgtaatc 
ggatcaagag 
aaatactgtc 
gcctacatac 
gtgtcttacc 
aacggggggt 
cctacagcgt 
tccggtaagc 
ctggtatctt 
atgctcgtca 
cctggccttt 



ctagcttccc 
ctgcgctcgg 
ggctctcgcg 
atctacacga 
ggtgcctcac 
gtgattaaca 
aaactcgccc 
cgggctttgc 
cctttagaag 
gctttactaa 
cagtatgaaa 
aatgcattat 
caagagcatc 
ttattacgac 
ggccttgaat 
aagcagcata 
atcctttttg 
tcagaccccg 
tgctgcttgc 
ctaccaactc 
cttctagtgt 
ctcgctctgc 
gggttggact 
tcgtgcacac 
gagctatgag 
ggcagggtcg 
tatagtcctg 
ggggggcgga 
tgctggcctt 



2160 
2220 
2280 
2340 
2400 
2460 
2520 
2580 
2640 
2700 
2760 
2820 
2880 
2940 
3000 
3060 
3120 
3180 
3240 
3300 
3360 
3420 
3480 
3540 
3600 
3660 
3720 
3780 
3840 
3859 



<210> 75 
<211> 3934 
<212> DNA 

<213> Artificial Seguence 
<220> 

<223> Description of Artificial Seguence: plasmid, pOTP73 



<400> 75 

ccatcgaatg 

tattttacca 

agaaaaggag 

cgggtcatcg 

ctggtcgaac 

gttcgcagcg 

tcggcgctgg 

taccagcgcc 

aaggcctgcc 

ggagtgcatc 

acccataccc 

cagccagggc 

ttcgttcgtg 

ctgacgggtg 

gacctggttt 

gacctgcagg 

gcttgacctg 

accgctactg 

tggtggttac 

ctttcttccc 

ggctcccttt 



gccagatgat 
ctccctatca 
gaatttaaat 
agggggcggc 
agttaacgtc 
agcaggaaac 
ttgcatcatc 
tggggctcga 
tggtcattga 
tggggggcta 
gacgtatccg 
aggccacggc 
agcagtacgc 
gggatgccga 
tcgtagggct 
gggaccatgg 
tgaagtgaaa 
cgtcacggat 
gcgcagcgtg 
ttcctttctc 
agggttccga 



taattcctaa 
gtgatagaga 
gattcttgag 
gcggtcggtc 
gcagcaagcg 
ctcgcaactg 
cggcaagcag 
ccgctggctg 
tctggggacc 
catatgcccg 
ctacgacgat 
cgaggcggtt 
catggcgtgc 
actggttcgc 
ggcactggct 
tctcagcgct 
aatggcgcac 
ctccacgcgc 
accgctacac 
gccacgttcg 
tttagtgctt 



tttttgttga 
aaagtgaaat 
ctcgactgtg 
gccggtggcc 
ctgccagtgc 
gtcgcacggt 
ttggcgggtg 
gccctcgtcg 
gcggtcacct 
ggcatgaccc 
gcagaggccc 
gagcggggtt 
gagctgctcg 
gacgaactgg 
tgcccgattg 
tggagccacc 
attgtgcgac 
cctgtagcgg 
ttgccagcgc 
ccggctttcc 
tacggcacct 



cactctatca 
gaatagttcg 
gaaactcgct 
ttgcggagtc 
gagcctgtcg 
tggagcagct 
tgcgcaacgg 
cggctcatca 
ctgacctggt 
tgatgagaag 
ggcgggcgct 
gtctgctcat 
gtccggattg 
ctggcgcccg 
agtaaggatc 
cgcagttcga 
attttttttg 
cgcattaagc 
cctagcgccc 
ccgtcaagct 
cgaccccaaa 



ttgatagagt 
acaaaaatct 
gatcaagtgg 
cgatgatgcc 
cctggtgagc 
gttcccggtt 
ctatctcgat 
cctggctaag 
cgcggcggat 
ccagttgcgc 
tgccagtctc 
gctcaggggg 
tgaaatattc 
gatcatgccg 
cctcgaggtc 
aaaataataa 
tctgccgttt 
gcggcgggtg 
gctcctttcg 
ctaaatcggg 
aaacttgatt 



60 

120 

180 

240 

300 

360 

420 

480 

540 

600 

660 

720 

780 

840 

900 

960 

1020 

1080 

1140 

1200 

1260 
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agggtgatgg ttcacgtagt gggccatcgc 
tggagtccac gttctttaat agtggactct 
tctcggtcta ttcttttgat ttataaggga 
atgagctgat ttaacaaaaa tttaacgcga 
caggtggcac ttttcgggga aatgtgcgcg 
attcaaatat gtatccgctc atgagacaat 
aaaggaagag tatgagtatt caacatttcc 
tttgccttcc tgtttttgct cacccagaaa 
agttgggtgc acgagtgggt tacatcgaac 
gttttcgccc cgaagaacgt tttccaatga 
cggtattatc ccgtattgac gccgggcaag 
agaatgactt ggttgagtac tcaccagtca 
taagagaatt atgcagtgct gccataacca 
tgacaacgat cggaggaccg aaggagctaa 
taactcgcct tgatcgttgg gaaccggagc 
acaccacgat gcctgtagca atggcaacaa 
ttactctagc ttcccggcaa caattgatag 
cacttctgcg ctcggccctt ccggctggct 
agcgtggctc tcgcggtatc attgcagcac 
tagttatcta cacgacgggg agtcaggcaa 
agataggtgc ctcactgatt aagcattggt 
gtaaagtgat taacagcgca ttagagctgc 
cccgtaaact cgcccagaag ctaggtgtag 
ataagcgggc tttgctcgac gccttagcca 
tttgcccttt agaaggggaa agctggcaag 
gatgtgcttt actaagtcat cgcgatggag 
aaaaacagta tgaaactctc gaaaatcaat 
tagagaatgc attatatgca ctcagcgcag 
aagatcaaga gcatcaagtc gctaaagaag 
cgecattatt acgacaagct atcgaattat 
tattcggcct tgaattgatc atatgcggar 
cttaaaagca gcataacctt tttccgtgat 
tgaagatcct ttttgataat ctcatgacca 
gagcgtcaga ccccgtagaa aagatcaaag 
taatctgctg cttgcaaaca aaaaaaccac 
aagagctacc aactcttttt ccgaaggtaa 
ctgtccttct agtgtagccg tagttaggcc 
catacctcgc bctgctaatc ctgttaccag 
ttaccgggtt ggactcaaga cgatagttac 
ggggttcgtg cacacagccc agcttggagc 
agcgtgagct atgagaaagc gccacgcttc 
taagcggcag ggtcggaaca ggagagcgca 
atctttatag tcctgtcggg tttcgccacc 
cgtcaggggg gcggagccta tggaaaaacg 
ccttttgctg gccttttgct cacatgaccc 



cctgatagac ggtttttcgc cctttgacgt 1320 

tgttccaaac tggaacaaca ctcaacccta 1380 

ttttgccgat ttcggcctat tggttaaaaa 1440 

attttaacaa aatattaacg cttacaattt 1500 

gaacccctat ttgtttattt ttctaaatac 1560 

aaccctgata aatgcttcaa taatattgaa 1620 

gtgtcgccct tattcccttt tttgcggcat 1680 

cgctggtgaa agtaaaagat gctgaagatc 1740 

tggatctcaa cagcggtaag atccttgaga 1800 

tgagcacttt taaagttctg ctatgtggcg 1860 

agcaactcgg tcgccgcata cactattctc 1920 

cagaaaagca tcttacggat ggcatgacag 1980 

tgagtgataa cactgcggcc aacttacttc 2040 

ccgctttttt gcacaacatg ggggatcatg 2100 

tgaatgaagc cataccaaac gacgagcgtg 2160 

cgttgcgcaa actattaact ggcgaactac 2220 

actggatgga ggcggataaa gttgcaggac 2280 

ggtttattgc tgataaatct ggagccggtg 2340 

tggggccaga tggtaagccc tcccgtatcg 2400 

ctatggatga acgaaataga cagatcgctg 2460 

aggaattaat gatgtctcgt ttagataaaa 2520 

ttaatgaggt cggaatcgaa ggtttaacaa 2580 

agcagcctac attgtattgg catgtaaaaa 2640 

ttgagatgtt agataggcac catactcact 2700 

attttttacg taataacgct aaaagtttta 2760 

caaaagtaca tttaggtaca cggcctacag 2820 

tagccttttt atgccaacaa ggtttttcac 2880 

tggggcattt tactttaggt tgcgtattgg 2940 

aaagggaaac acctactact gatagtatgc 3000 

ttgatcacca aggtgcagag ccagccttct 3060 

tagaaaaaca acttaaatgt gaaagtgggr 3120 

ggtaacttca ctagtttaaa aggatctagg 3180 

aaatccctta acgtgagttt tcgttccact 3240 

gatcttcttg agatcctttt tttctgcgcg 3300 

cgctaccagc ggtggtttgt ttgccggatc 3360 

ctggcttcag cagagcgcag ataccaaata 3420 

accacttcaa gaactctgta gcaccgccta 3480 

tggctgctgc cagtggcgat aagtcgtgtc 3540 

cggataaggc gcagcggtcg ggctgaacgg 3600 

gaacgaccta caccgaactg agatacctac 3660 

ccgaagggag aaaggcggac aggtatccgg 3720 

cgagggagct tccaggggga aacgcctggt 3780 

tctgacttga gcgtcgattt ttgtgatgct 3840 

ccagcaacgc ggccttttta cggttcctgg 3900 

gaca 3934 



SEQ ID NO: 69 Pseudomonas syringae pv tomato coaX coding sequence 

atgatt cttgagctcgattgcggcaacagctttatcaagtggcggataatcacaaagagttgctcaacgttggtcag 
cggcggagtagtggact cggacacagccttgctagagtgcctgggcaatctgtcaggcgcagcattcagcgattgcc 
gtctggtaagcgttcgtagcgcggaagaaacggcgaagctggtttgcgcgctggcagataccttttccattagccct 
gtctgtgcagcgccggcgccagagcttgccggggtaatcaatggatacgacgattttgcacgcttggggctggatcg 
ctggttggcatttgtaggggcttaccaccttgttaagggtgcctgcctggtgatcgatctgggcaccgccattacgt 
ctgactttgttgaagcgtcaggaaagcatctgggtggtttcat ctgtcctggcatgccactgatgcgcaatcagctg 
cgtacccacacccgtcgcattcgatatgacgatgcagaggctgaaaaagccctggtacgactcgtgcctggccgtgc 
gacggccgaggctgtggagcgaggttgttctctcatgcttcgcggattcgcaatgactcagatcgagatagctcgcg 
aatactggggggacgactttgctattttcgtgacaggaggcgacgctgtcttggttgctgatgtgttaccgggcgct 
cgcattgtccctgatttggtattcgttggcctggctctcgcttgccctttacgttga 

SEQ ID NO:70 Pseudomonas syringae pv tomato coaX protein sequence 
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MILELDCGNSFIKWRIITKSCSTLVSGGVVDSDTALLECLGNLSGAAFSDCRLVSVRSAEETAKLVCALADTFSISP 
VCAAPAPELAGVINGYDDFARLGLDRWLAFVGAYHLVKGACLVIDLGTAITSDFVEASGKHLGGFICPGMPLMRNQL 
RTHTRRIRYDDAEAEKALVRLVPGRATAEAVERGCSLMLRGFAMTQIEIAREYWGDDFAIFVTGGDAVLVADVLPGA 
RIVPDLVFVGLALACPLR 

SEQ ID NO: 71 - pAN2 9 6 

TGCGCCGCTACAGGGCGCGTCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCT 
ATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTT 
GTAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGGGCGAATTGGGCCCGACGTCgcatgcTGGATGAAAAGCC 
GATGACCGCTTTTCAGGTCTGTCAGCAGCTTTTTCCTGCTGTATATGAAAAGGAATTGTTTTTAACGATGTCAGAAACGG 
CAGGTCACCTTGATGTGTTGGAGGCTGAAGAAGCCATCACGTCATATTGGGAAGGAAATACCGTATACTTTAAAACAATG 
AAGAGGTGAAATGGGTGAAACATATAGCGGGAAAAAGGATTTGGATAACCGGCGCTTCAGGAGGGCTTGGAGAAAGAATC 
GCATACTTATGCGCGGCTGAAGGAGCCCATGTCCTGCTGTCGGCTAGACGCGAGGATCGTTTGATAGAAATCAAAAGGAA 
AATAACCGAGGAATGGAGCGGACAGTGTGAGATTTTTCCTCTGGATGTCGGCCGCCTAGAGGATATCGCCCGGGTCCGCG 
ATCAGATCGGCTCGATTGATGTACTGATTAACAATGCAGGCTTCGGTATATTTGAAACGGTTTTAGACTCTACATTGGAT 
GACATGAAAGCGATGTTTGATGTGAATGTCTTCGGCCTGATCGCCTGTACAAAAGCGGTGCTTCCGCAAATGCTTGAGCA 
AAAAAAGGGACATATCATCAATATCGCCTCTCAAGCGGGGAAAATCGCCACACCGAAGTCTAGCCTGTATTCCGCGACCA 
AACATGCCGTGTTAGGTTACTCAAACGCTTTGCGGATGGAGCTTTCGGGAACCGGCATTTATGTGACAACAGTCAACCCG 
GGCCCGATTCAGACGGACTTTTTTTCCATTGCTGATAAAGGCGGGGACTACGCCAAAAATGTCGGCCGCTGGATGCTTGA 
TCCTGATGACGTGGCAGCTCAAATTACAGCTGCAATTTTTACGAAAAAGCGGGAGATCAATCTTCCGCGTTTAATGAATG 
CCGGCACTAAGCTGTATCAGCTGTTTCCAGCTCTTGTAGAAAAGCTGGCAGGACGCGCGCTCATGAAAAAATAATGATAG 
AACTGCCTGTGGTGGAGTGGCTTGTTTCTCACGGGGCAGTTTTTGATAGTGGAAGGGAGAGATTGTTGAATGTCAGTTCA 
TTCAGAAGTCCTTCATGCTCTGCTTAAAGATCCGTTTATTCAGAAACTGATTGATGCAGAGCCTGTATTCTGGGCAAATT 
CAGGCAAGAAAGAGGGGCCATTACCCCGTGCAGATGAGTGGGCAACCGAGATAGCGGAAGCGGAAAAAAGAATGCAGCGG 
TTTGCACCTTACATTGCCGAGGTGTTTCCTGAGACGAAAGGCGCTAAAGGAATCATCGAGTCTCCGCTTTTTGAGGTGCA 
GCATATGAAGGGAAAGCTGGAAGCGGCATATCAGCAGCCATTTCCCGGAAGATGGCTTTTAAAGTGCGACCATGAGCTTC 
CGATTTCAGGATCGATTAAAGCGAGGGGCGGGATTTATGAAGTGTTAAAGTATGCTGAAAATCTCGCGCTTCAAGAAGGA 
ATGCTTCAGGAAACCGATGATTACCGCATCTTACAGGAAGAGCGGTTTACCGGGTTTTTCTCCCGCTATTCGATTGCTGT 
CGGTTCGACAGGAAATCTAGGTTTAAGCATCGGCATCATCGGCGCGGCACTCGGGTTTCGCGTGACAGTGCATATGTCCG 
CCGATGCTAAGCAGTGGAAAAAGGATCTCCTCCGCCAAAAGGGAGTCACTGTTATGGAGTACGAAACAGATTACAGTGAA 
GCGGTGAACGAAGGGAGACGGCAGGCGGAACAAGATCCATTCTGTTATTTTATTGATGATGAACATTCTCGTCAGCTGTT 
CTTAGGATATGCTGTTGCTGCAAGCCGATTAAAAACACAGCTTGACTGTATGAATATAAAGCCAAGTCTTGAGACGCCCT 
TGTTTGTGTATCTGCCGTGCGGAGTCGGCGGAGGACCGGGCGGTGTAGCATTTGGGCTGAAGCTTTTATACGGAGATGAT 
GTTCATGTGTTTTTCGCAGAACCAACTCATTCACCTTGTATGCTGTTAGGGCTTTATTCAGGACTTCACGAGAAGATCTC 
CGTCCAGGATATCGGCCTGGATAATCAGACGGCTGCTGACGGACTTGCCGTAGGGAGGCCGTCAGGATTTGTCGGCAAGC 
TGATTGAACCGCTTCTGAGCGGCTGTTATACGGTAGAGGACAATACGCTTTATACTTTGCTTCATATGCTGGCTGTATCT 
GAAGATAAATATTTAGAGCCCTCTGCTCTTGCTGGCATGTTCGGGCCGGTTCAGCTTTTTTCGACAGAAGAGGGAAGGCG 
CTATGCTCAGAAATATAAGATGGAACATGCCGTACATGTCGTCTGGGGAACGGGAGGAAGCATGGTTCCAAAAGATGAAA 

TGGCTGCGTATAACCGAATCGGTGCTGATTTGCTAAAAAAACGAAATGGAAAATAAGCAGACAGTGAAAAGGTTTTCCGT 
TACAATCTTTGTAAGGGTTTTAACCTACAGAGAGTCAGGTGTAAACAGTGAAAAATAAAGAACTTAACCTACATACTTTA 
TATACACAGCACAATCGGGAGTCTTctgcagctcgagcAATAGTTACCCTTATTATCAAGATAAGAAAGAAAAGGATTTT 
TCGCTACGCTCAAATCCTTTAAAAAAACACAAAAGACCACATTTTTTAATGTGGTCTTTATTCTTCAACTAAAGCACCCA 
TTAGTTCAACAAACGAAAATTGGATAAAGTGGGATATTTTTAAAATATATATTTATGTTACAGTAATATTGACTTTTAAA 
AAAGGATTGATTCTAATGAAGAAAGCAGACAAGTAAGCCTCCTAAATTCACTTTAGATAAAAATTTAGGAGGCATATCAA 
ATGAACTTTAATAAAATTGATTTAGACAATTGGAAGAGAAAAGAGATATTTAATCATTATTTGAACCAACAAACGACTTT 
TAGTATAACCACAGAAATTGATATTAGTGTTTTATACCGAAACATAAAACAAGAAGGATATAAATTTTACCCTGCATTTA 
TTTTCTTAGTGACAAGGGTGATAAACTCAAATACAGCTTTTAGAACTGGTTACAATAGCGACGGAGAGTTAGGTTATTGG 
GATAAGTTAGAGCCACTTTATACAATTTTTGATGGTGTATCTAAAACATTCTCTGGTATTTGGACTCCTGTAAAGAATGA 
CTTCAAAGAGTTTTATGATTTATACCTTTCTGATGTAGAGAAATATAATGGTTCGGGGAAATTGTTTCCCAAAACACCTA 
TACCTGAAAATGCTTTTTCTCTTTCTATTATTCCATGGACTTCATTTACTGGGTTTAACTTAAATATCAATAATAATAGT 
AATTACCTTCTACCCATTATTACAGCAGGAAAATTCATTAATAAAGGTAATTCAATATATTTACCGCTATCTTTACAGGT 
ACATCATTCTGTTTGTGATGGTTATCATGCAGGATTGTTTATGAACTCTATTCAGGAATTGTCAGATAGGCCTAATGACT 
GGCTTTTATAATATGAGATAATGCCGACTGTACTTTTTACAGTCGGTTTTCTAATGTCACTAACCTGCCCCGTTAGTTGA 
AGAAGGTTTTTATATTACAGCTgtcgacTAAGGTCGAGGAAGTGTTGGTAAGGAGGGTATGAAATGTGCATCATATTGAA 
CTGTATGTCTCTGATTTGGAGGCGTCTAGGCGGTTTTGGGGCTGGTTCTTAAAAGAACTTGGTTATAAAGAGTATCAAAA 
ATGGAGCTCAGGCATCAGCTGGAAGAAAGATCGTTTTTACCTAGTGATTGTGCAGGCGAAAGAGCCATTTCTAGAGCCGG 
AATACCATAGATGCCGAGTCGGTCTGAACCATCTCGCATTTCATGCTGAATCCAAGCTTCAAGTCGATCAGATGACTGAA 
AAATTGACGGCAAAAGGCTATCGTGTGTTGTACCGAGACAGGCATCCTTTTGCCGGAGGAGACGGGCATTATGCAGTCTT 
TTGTGAGGATCCAGACCGGATTAAGGTAGAGCTCGTTGCCCCAAGCTGTTAATCGTGATCTTCGGACAGGCTGTTCAGCT 
TTTTCTCAATGCGATCCAGCTGCGCTTTTCGGTTTTTCGCATACTTGAAGCCTGTAACAGCCGCAAAGACGACAGCGGCA 
AATATAATAAATACAAACAGCTGAAACATCACATCACCTATATTCATGTTCTTCACCTCATGTTTGCGGGAGAGATTCAT 
TCTCTTCCGTTTTTTATTTAAAGCGGCTTTTCCAGACGGGAACGGTGTTTTGTGGTCTCCATTTTCATTTGCCGATAGGC 
GAACGCTAAAAATGGCAGGCCGAGCAGGGTAATGCCGCTCAGGACAGAAAAAATATAAATCGGCCGGCCAGCGCCAAACA 
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GGTCTATACATATCCCCCCGACCCAAGGGCCGATGACGTTTCCGAGCTGTGGAAAACCGATTGCCCCGAAATAAGTGCCT 
TTTAATCCTGGTTTTGCAATCTGGTCTACATACAAATCCATCATAGAGAATAAAAGCACTTCGCCGATTGTAAATGTGAT 
GACAATCATCACAATTGATGGAACACCGTGTGATACGGTGAAAATGGCCATGCTGATGCTAACCATCACATTACCGAGCA 
TCAGAGAACAAAGCGGCGAAAACCGTTTTGCAAAATGGACAATGGGAAATTGCGTCGCCAACACAACGATTGCGTTTAAT 
GTCAGCATCAGCCCATACAGCTTCGTTCCATTGCCGATCAAGGGGTTCTGCGCCATATACTGAGGGAATGTGGAACTGAA 
TTGTGAGTAGCCGAAGGTGCATAGCGTAATGCCGACCAAAGCAATGGTAAAAAGATAATCCTTTTGCGTGACCATAAACG 
CTTCCCGCACGCTCATATTTCGGGACTGGGCTGGTGCTGATAAGGATGGATGTTTTTTAAATTGGAGGGCAAGCACAATT 
CCGTATAGTCCGTAAATGACTGCAGGCACCAAAAAGGGCGTAGTCGATTGCGATGAGCCGAAATATAGGCCAAGCACAGG 
TCCGAAGACAACGCCGATATTAATAGCCGCATAGCGTAAATTAAAAACTAGCAGTCTCGTTTTTTCTTCTGTCATATCAG 
ACAACAAGGCCTTTGAAGCGGGCTCAAACAGTGATTTGCAAAGACCGTTTAATGCGTTTACTACAAAAAACACCCAGAGA 
TTAGATGCTGCCGCAAAGCCTGCAAATACCAGCATCCATCCGAAAATCGATACAAGCATCATGTTTTTTCTGCCGAATTT 
ATCTGAGATATATCCGCCGTAAAAGCTTGCGAGGATGCCGACTGATGAGCTCGCGGCGATGACCAGCCCTGCATAGGAAG 
CTGATGCGCCTTGGACGGCTGTCAAATAAATCGCTAAAAAAGGAATGCTCATCGATGTTGCCATTCTGCCGAAAATGGTT 
CCGATTATAATTGTAacgcgtTGGATGCATAGCTTGAGTATTCTATAGTGTCACCTAAATAGCTTGGCGTAATCATGGTC 
ATAGCTGTTTCCTGTGTGAAATTGTTATCCGCTCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCT 
GGGGTGCCTAATGAGTGAGCTAACTCACATTAATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGC 
CAGCTGCATTAATGAATCGGCCAACGCGCGGGGAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGA 
CTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAGCGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAG 

GGGATAACGCAGGAAAGAACATGTGAGCAAAAGGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTT 
TTCGATAGGCTCCGCCCCCCTGACGAGCATCACAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATA 
AAGATACCAGGCGTTTCCCCCTGGAAGCTCCCTCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCG 
CCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCTCATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCC 
AAGCTGGGCTGTGTGCACGAACCCCCCGTTCAGCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCC 
GGTAAGACACGACTTATCGCCACTGGCAGCAGCCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAG 
AGTTCTTGAAGTGGTGGCCTAACTACGGCTACACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACC 
TTCGGAAAAAGAGTTGGTAGCTCTTGATCCGGCAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGCAAGCAGCA 
GATTACGCGCAGAAAAAAAGGATCTCAAGAAGATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACT 
CACGTTAAGGGATTTTGGTCATGAGATTATCAAAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAA 
TCAATCTAAAGTATATATGAGTAAACTTGGTCTGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTG 
TCTATTTCGTTCATCCATAGTTGCCTGACTCCCCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCA 
GTGCTGCAATGATACCGCGAGACCCACGCTCACCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAG 
CGCAGAAGTGGTCCTGCAACTTTATCCGCCTCCATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTA^GTAGTTCGCC 
AGTTAATAGTTTGCGCAACGTTGTTGGCATTGCTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCA 
GCTCCGGTTCCCAACGATCAAGGCGAGTTACATGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCG 
ATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTTATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCC 
ATCCGTAAGATGCTTTTCTGTGACTGGTGAGTACTCAACCAAGTCATTCTGAGAATACCGCGCCCGGCGACCGAGTTGCT 
CTTGCCCGGCGTCAATACGGGATAATAGTGTATGACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCG 
GGGCGAAAACTCTCAAGGATCTTACCGCTGTTGAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGC 
ATCTTTTACTTTCACCAGCGTTTCTGGGTGAGCAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACAC 

GGAAATGTTGAATACTCATACTCTTCCTTTTTCAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATAC 
ATATTTGAATGTATTTAGAAAAATAAACAAATAGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGTATGCGGTGT 
GAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGAAATTGTAAACGTTAATATTTTGTTAAAATTCGCGT 
TAAATATTTGTTAAATCAGCTCATTTTTTAACCAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACC 
GAGATAGGGTTGAGTGTTGTTCCAGTTTGGAACAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAA 
AACCGTCTATCAGGGCGATGGCCCACTACGTGAACCATCACCCAAATCAAGTTTTTTGCGGTCGAGGTGCCGTAAAGCTC 
TAAATCGGAACCCTAAAGGGAGCCCCCGATTTAGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAG 
AAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGCAAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAA 



SEQ ID NO: 72 - pAN336 

TGCGCCGCTACAGGGCGCGTCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGGCCTCTTCGCT 
ATTACGCCAGCTGGCGAAAGGGGGATGTGCTGCAAGGCGATTAAGTTGGGTAACGCCAGGGTTTTCCCAGTCACGACGTT 
GTAAAACGACGGCCAGTGAATTGTAATACGACTCACTATAGGGCGAATTGGGCCCGACGTCGCATGCACCAGGCTTCTCA 
GGCGCTGACTTAGAAAACCTCTTGAATGAAGCTGCGCTTGTAGCGGCTCGTCAAAACAAGAAAAAAATCGATGCGCGTGA 
TATTGACGAAGCGACGGACCGTGTAATTGCCGGACCCGCTAAGAAGAGCCGCGTTATCTCCAAGAAAGAACGCAATATCG 
TGGCTTATCACGAAGGCGGACACACCGTTATCGGTCTCGTTTTAGATGAGGCAGATATGGTTCATAAAGTAACGATTGTT 
CCTCGGGGCCAGGCTGGCGGTTATGCTGTTATGCTGCCAAGAGAAGACCGTTATTTCCAAACAAAGCCGGAGCTGCTTGA 
TAAAATTGTCGGCCTCTTGGGCGGACGTGTTGCTGAAGAGATTATCTTCGGTGAAGTCAGCACAGGGGCGCACAATGACT 
TCCAGCGTGCGACGAATATTGCAAGACGAATGGTTACAGAATTCGGTATGTCAGAAAAACTGGGACCGTTGCAATTTGGA 
CAGTCTCAGGGCGGTCAGGTATTCTTAGGCCGTGATTTCAACAACGAACAGAACTACAGTGATCAAATCGCTTACGAAAT 
TGATCAGGAAATTCAGCGCATCATCAAAGAATGTTATGAGCGTGCGAAACAAATCCTGACTGAAAATCGTGACAAGCTTG 
AATTGATTGCCCAAACGCTTCTGAAAGTTGAAACGCTTGACGCTGAACAAATCAAACACCTTATCGATCATGGAACATTA 
CCTGAGCGTAATTTCTCAGATGATGAAAAGAACGATGATGTGAAAGTAAACATTCTGACAAAAACAGAAGAAAAGAAAGA 
CGATACGAAAGAGTAATTCGCTTTCTTTCTAAAAAAACTGCCGGCTGACGCTGGCAGTTTTTTTATGTAAATGATTGGCT 
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CAGCTGCGGCTTTTACAATCATCCAATTCTGGTATCGATTTGTTTACAAATGAGCCGCTGATCGTGTATGGTATTGTAGA 
ATGTTTGTAAAAAGTAAAGTAGAGAAACTATTCAAAAGTGGTGATAGAGGTTGTTACTGGTTATCGATGTGGGGAACACC 
CTGCAGCTCGAGTGAAATACCGCACAGATGCGTAAGGAGAAAATACCGCATCAGGCGATAAACCCAGCGAACCATTTGAG 
GTGATAGGTAAGATTATACCGAGGTATGAAAACGAGAATTGGACCTTTACAGAATTACTCTATGAAGCGCCATATTTAAA 
AAGCTACCAAGACGAAGAGGATGAAGAGGATGAGGAGGCAGATTGCCTTGAATATATTGACAATACTGATAAGATAATAT 
ATCTTTTATATAGAAGATATCGCCGTATGTAAGGATTTCAGGGGGCAAGGCATAGGCAGCGCGCTTATCAATATATCTAT 
AGAATGGGCAAAGCATAAAAACTTGCATGGACTAATGCTTGAAACCCAGGACAATAACCTTATAGCTTGTAAATTCTATC 
ATAATTGTGGTTTCAAAATCGGCTCCGTCGATACTATGTTATACGCCAACTTTCAAAACAACTTTGAAAAAGCTGTTTTC 
TGGTATTTAAGGTTTTAGAATGCAAGGAACAGTGAATTGGAGTTCGTCTTGTTATAATTAGCTTCTTGGGGTATCTTTAA 
ATACTGTAGAAAAGAGGAAGGAAATAATAAATGGCTAAAATGAGAATATCACCGGAATTGAAAAAACTGATCGAAAAATA 
CCGCTGCGTAAAAGATACGGAAGGAATGTCTCCTGCTAAGGTATATAAGCTGGTGGGAGAAAATGAAAACCTATATTTAA 
AAATGACGGACAGCCGGTATAAAGGGACCACCTATGATGTGGAACGGGAAAAGGACATGATGCTATGGCTGGAAGGAAAG 
CTGCCTGTTCCAAAGGTCCTGCACTTTGAACGGCATGATGGCTGGAGCAATCTGCTCATGAGTGAGGCCGATGGCGTCCT 
TTGCTCGGAAGAGTATGAAGATGAACAAAGCCCTGAAAAGATTATCGAGCTGTATGCGGAGTGCATCAGGCTCTTTCACT 
CCATCGACATATCGGATTGTCCCTATACGAATAGCTTAGACAGCCGCTTAGCCGAATTGGATTACTTACTGAATAACGAT 
CTGGCCGATGTGGATTGCGAAAACTGGGAAGAAGACACTCCATTTAAAGATCCGCGCGAGCTGTATGATTTTTTAAAGAC 
GGAAAAGCCCGAAGAGGAACTTGTCTTTTCCCACGGCGACCTGGGAGACAGCAACATCTTTGTGAAAGATGGCAAAGTAA 
GTGGCTTTATTGATCTTGGGAGAAGCGGCAGGGCGGACAAGTGGTATGACATTGCCTTCTGCGTCCGGTCGATCAGGGAG 
GATATCGGGGAAGAACAGTATGTCGAGCTATTTTTTGACTTACTGGGGATCAAGCCTGATTGGGAGAAAATAAAATATTA 
TATTTTACTGGATGAATTGTTTTAGTACCTAGATTTAGATGTCTAAAAAGCTTTAACTACAAGCTTTTTAGACATCTAAT 
CTTTTCTGAAGTACATCCGCAACTGTCCATACTCTGATGTTTTATATCTTTTCTAAAAGTTCGCTAGATAGGGGTCCCGA 
GCGCCTACGAGGAATTTGTATCGCCATTCGCCATTCAGGCTGCGCAACTGTTGGGAAGGGCGATCGGTGCGGTCGACTGG 
CAGGCAAAACAGGACCCAAGGTCATTGCGACAGGAGGCCTGGCGCCGCTCATTGCGAACGAATCAGATTGTATAGACATC 
GTTGATCCATTCTTAACCCTAAAAGGGCTGGAATTGATTTATGAAAGAAACCGCGTAGGAAGTGTATAGGAGGTTTAGTA 
ATGGATTATTTAGTAAAAGCACTTGCGTATGACGGAAAAGTTCGGGCTTATGCAGCGAGAACGACTGATATGGTAAATGA 
GGGGCAGAGACGCCATGGTACGTGGCCGACAGCATCCGCTGCACTAGGCCGTACAATGACAGCTTCACTTATGCTCGGCG 
CTATGCTGAAGGGCGATGATAAGCTGACCGTGAAAATCGAGGGCGGAGGTCCGATCGGAGCTATTGTAGCTGATGCCAAT 
GCCAAAGGAGAAGTCAGAGCCTATGTCTCTAACCCGCAAGTTCATTTTGATTTAAATGAACAAGGTAAGCTTGATGTCAG 
ACGTGCGGTTGGAACAAACGGAACGTTAAGTGTCGTAAAAGATTTAGGTTTGCGCGAGTTCTTCACAGGACAAGTAGAAA 
TCGTTTCAGGAGAATTAGGAGATGATTTTACTTACTATCTTGTGTCATCTGAGCAGGTTCCTTCATCAGTGGGCGTAGGT 
. GTGCTCGTAAATCCTGACAATACCATTCTTGCGGCAGGGGGCTTTATTATTCAGCTGATGCCGGGAACAGATGATGAAAC 
AATCACAAAAATTGAACAGCGTCTATCTCAAGTAGAGCCGATTTCTAAGCTCATCCAAAAAGGGCTGACACCAGAAGAAA 
TTTTAGAAGAAGTCCTAGGCGAGAAACCTGAGATTTTGGAAACGATGCCTGTCAGATTCCATTGCCCTTGTTCAAAAGAA 
CGGTTCGAAACAGCCATTTTAGGACTAGGCAAAAAAGAAATTCAAGATATGATAGAAGAAGATGGACAAGCCGAAGCAGT 
ATGCCATTTTTGTAATGAAAAGTACTTATTTACAAAAGAAGAGCTGGAAGGGCTTCGTGACCAAACTACCCGCTAAGCTC 
TTTAGCGGGTTTTTAATTTGAGAAAAGGGGCTGAAAGCAGGTTTGAAATCAAGAACAATCTGGACGCGTTGGATGCATAG 
CTTGAGTATTCTATAGTGTCACCTAAATAGCTTGGCGTAATCATGGTCATAGCTGTTTCCTGTGTGAAATTGTTATCCGC 
TCACAATTCCACACAACATACGAGCCGGAAGCATAAAGTGTAAAGCCTGGGGTGCCTAATGAGTGAGCTAACTCACATTA 
ATTGCGTTGCGCTCACTGCCCGCTTTCCAGTCGGGAAACCTGTCGTGCCAGCTGCATTAATGAATCGGCCAACGCGCGGG 

GAGAGGCGGTTTGCGTATTGGGCGCTCTTCCGCTTCCTCGCTCACTGACTCGCTGCGCTCGGTCGTTCGGCTGCGGCGAG 
CGGTATCAGCTCACTCAAAGGCGGTAATACGGTTATCCACAGAATCAGGGGATAACGCAGGAAAGAACATGTGAGCAAAA 
GGCCAGCAAAAGGCCAGGAACCGTAAAAAGGCCGCGTTGCTGGCGTTTTTCGATAGGCTCCGCCCCCCTGACGAGCATCA 
CAAAAATCGACGCTCAAGTCAGAGGTGGCGAAACCCGACAGGACTATAAAGATACCAGGCGTTTCCCCCTGGAAGCTCCC 
TCGTGCGCTCTCCTGTTCCGACCCTGCCGCTTACCGGATACCTGTCCGCCTTTCTCCCTTCGGGAAGCGTGGCGCTTTCT 
CATAGCTCACGCTGTAGGTATCTCAGTTCGGTGTAGGTCGTTCGCTCCAAGCTGGGCTGTGTGCACGAACCCCCCGTTCA 
GCCCGACCGCTGCGCCTTATCCGGTAACTATCGTCTTGAGTCCAACCCGGTAAGACACGACTTATCGCCACTGGCAGCAG 
CCACTGGTAACAGGATTAGCAGAGCGAGGTATGTAGGCGGTGCTACAGAGTTCTTGAAGTGGTGGCCTAACTACGGCTAC 
ACTAGAAGGACAGTATTTGGTATCTGCGCTCTGCTGAAGCCAGTTACCTTCGGAAAAAGAGTTGGTAGCTCTTGATCCGG 
CAAACAAACCACCGCTGGTAGCGGTGGTTTTTTTGTTTGC^U^GCAGCAGATTACGCGC^GAAAAAAAGGATCTCAAGAAG 
ATCCTTTGATCTTTTCTACGGGGTCTGACGCTCAGTGGAACGAAAACTCACGTTAAGGGATTTTGGTCATGAGATTATCA 
AAAAGGATCTTCACCTAGATCCTTTTAAATTAAAAATGAAGTTTTAAATCAATCTAAAGTATATATGAGTAAACTTGGTC 
TGACAGTTACCAATGCTTAATCAGTGAGGCACCTATCTCAGCGATCTGTCTATTTCGTTCATCCATAGTTGCCTGACTCC 
CCGTCGTGTAGATAACTACGATACGGGAGGGCTTACCATCTGGCCCCAGTGCTGCAATGATACCGCGAGACCCACGCTCA 
CCGGCTCCAGATTTATCAGCAATAAACCAGCCAGCCGGAAGGGCCGAGCGCAGAAGTGGTCCTGCAACTTTATCCGCCTC 
CATCCAGTCTATTAATTGTTGCCGGGAAGCTAGAGTAAGTAGTTCGCCAGTTAATAGTTTGCGCAACGTTGTTGGCATTG 
CTACAGGCATCGTGGTGTCACGCTCGTCGTTTGGTATGGCTTCATTCAGCTCCGGTTCCCAACGATCAAGGCGAGTTACA 
TGATCCCCCATGTTGTGCAAAAAAGCGGTTAGCTCCTTCGGTCCTCCGATCGTTGTCAGAAGTAAGTTGGCCGCAGTGTT 
ATCACTCATGGTTATGGCAGCACTGCATAATTCTCTTACTGTCATGCCATCCGTAAGATGCTTTTCTGTGACTGGTGAGT 
ACTCAACCAAGTCATTCTGAGAATACCGCGCCCGGCGACCGAGTTGCTCTTGCCCGGCGTCAATACGGGATAATAGTGTA 
TGACATAGCAGAACTTTAAAAGTGCTCATCATTGGAAAACGTTCTTCGGGGCGAAAACTCTCAAGGATCTTACCGCTGTT 
GAGATCCAGTTCGATGTAACCCACTCGTGCACCCAACTGATCTTCAGCATCTTTTACTTTCACCAGCGTTTCTGGGTGAG 
CAAAAACAGGAAGGCAAAATGCCGCAAAAAAGGGAATAAGGGCGACACGGAAATGTTGAATACTCATACTCTTCCTTTTT 
CAATATTATTGAAGCATTTATCAGGGTTATTGTCTCATGAGCGGATACATATTTGAATGTATTTAGAAAAATAAACAAAT 
AGGGGTTCCGCGCACATTTCCCCGAAAAGTGCCACCTGTATGCGGTGTGAAATACCGCACAGATGCGTAAGGAGAAAATA 
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CCGCATCAGGCGAAATTGTAAACGTTAATATTTTGTTAAAATTCGCGTTAAATATTTGTTAAATCAGCTCATTTTTTAAC 
CAATAGGCCGAAATCGGCAAAATCCCTTATAAATCAAAAGAATAGACCGAGATAGGGTTGAGTGTTGTTCCAGTTTGGAA 
CAAGAGTCCACTATTAAAGAACGTGGACTCCAACGTCAAAGGGCGAAAAACCGTCTATCAGGGCGATGGCCCACTACGTG 
AACCATCACCCAAATCAAGTTTTTTGCGGTCGAGGTGCCGTAAAGCTCTAAATCGGAACCCTAAAGGGAGCCCCCGATTT 
AGAGCTTGACGGGGAAAGCCGGCGAACGTGGCGAGAAAGGAAGGGAAGAAAGCGAAAGGAGCGGGCGCTAGGGCGCTGGC 
AAGTGTAGCGGTCACGCTGCGCGTAACCACCACACCCGCCGCGCTTAA 



SEQ ID NO: 73 - pAN341 and pAN342 

TTGCGGCCGCTTCGAACTGTTATAAAAAAAGGATCAATTTTGAACTCTCTCCCAAAGTTGATCCCTTAACGATTTAGAAA 

TCCCTTTGAGAATGTTTATATACATTCAAGGTAACCAGCCAACTAATGACAATGATTCCTGAAAAAAGTAATAACAAATT 

ACTATACAGATAAGTTGACTGATCAACTTCCATAGGTAACAACCTTTGATCAAGTAAGGGTATGGATAATAAACCACCTA 

CAATTGCAATACCTGTTCCCTCTGATAAAAAGCTGGTAAAGTTAAGCAAACTCATTCCAGCACCAGCTTCCTGCTGTTTC 

AAGCTACTTGAAACAATTGTTGATATAACTGTTTTGGTGAACGAAAGCCCACCTAAAACAAATACGATTATAATTGTCAT 

GAACCATGATGTTGTTTCTAAAAGAAAGGAAGCAGTTAAAAAGCTAACAGAAAGAAATGTAACTCCGATGTTTAACACGT 

ATAAAGGACCTCTTCTATCAACAAGTATCCCACCAATGTAGCCGAAAATAATGACACTCATTGTTCCAGGGAAAATAATT 

ACACTTCCGATTTCGGCAGTACTTAGCTGGTGAACATCTTTCATCATATAAGGAACCATAGAGACAAACCCTGCTACTGT 

TCCAAATATAATTCCCCCACAAAGAACTCCAATCATAAAAGGTATATTTTTCCCTAATCCGGGATCAACAAAAGGATCTG 

TTACTTTCCTGATATGTTTTACAAATATCAGGAATGACAGCACGCTAACGATAAGAAAAGPiAATGCTATATGATGTTGTA 

AACAACATAAAAAATACAATGCCTACAGACATTAGTATAATTCCTTTGATATCAAAATGACCTTTTATCCTTACTTCTTT 

CTTTAATAATTTCATAAGAAACGGAACAGTGATAATTGTTATCATAGGAATGAGTAGAAGATAGGACCAATGAATATAAT 

GGGCTATCATTCCACCAATCGCTGGACCGACTCCTTCTCCCATGGCTACTATCGATCCAATAAGACCAAATGCTTTACCC 

CTATTTTCCTTTGGAATATAGCGCGCAACTACAACCATTACGAGTGCTGGAAATGCAGCTGCACCAGCCCCTTGAATAAA 

ACGAGCCATAATAAGTAAGGAAAAGAAAGAATGGCCAACAAACCCAATTACCGACCCGAAACAATTTATTATAATTCCAA 

ATAGGAGTAACCTTTTGATGCCTAATTGATCAGATAGCTTTCCATATACAGCTGTTCCAATGGAAAAGGTTAACATAAAG 

GCTGTGTTCACCCAGTTTGTACTCGCAGGTGGTTTATTAAAATCATTTGCAATATCAGGTAATGAGACGTTCAAAACCAT 

TTCATTTAATACGCTAAAAAAAGATAAAATGCAAAGCCAAATTAAAATTTGGTTGTGTCGTAAATTCGATTGTGAATAGG 

ATGTATTCACATTTCACCCTCCAATAATGAGGGCAGACGTAGTTTATAGGGTTAATGATACGCTTCCCTCTTTTAATTGA 

ACCCTGTTACATTCATTACACTTCATAATTAATTCCTCCTAAACTTGATTAAAACATTTTACCACATATAAACTAAGTTT 

TAAATTCAGTATTTCATCACTTATACAACAATATGGCCCGTTTGTTGAACTACTCTTTAATAAAATAATTTTTCCGTTCC 

CAATTCCACATTGCAATAATAGAAAATCCATCTTCATCGGCTTTTTGGTCATCATCTGTATGAATCAAATCGCCTTCTTC 

TGTGTCATCAAGGTTTAATTTTTTATGTATTTCTTTTAACAAACCACCATAGGAGATTAACCTTTTACGGTGTAAACCTT 

CCTCCAAATCAGACAAACGTTTCAAATTCTTTTCTTCATCATCGGTCATAAAATCCGTATCCTTTACAGGATATTTTGCA 

GTTTCGTCAATTGCCGATTGTATATCCGATTTATATTTATTTTTCGGTCGAATCATTTGAACTTTTACATTTGGATCATA 

GTCTAATTTCATTGCCTTTTTCCAAAATTGAATCCATTGTTTTTGATTCACGTAGTTTTCTGTATTCTTAAAATAAGTTG 

GTTCCACACATACCAATACATGCATGTGCTGATTATAAGAATTATCTTTATTATTTATTGTCACTTCCGTTGCACGCATA 

AAACCAACAAGATTTTTATTAATTTTTTTATATTGCATCATTCGGCGAAATCCTTGAGCCATATCTGACAAACTCTTATT 

TAATTCTTCGCCATCATAAACATTTTTAACTGTTAATGTGAGAAACAACCAACGAACTGTTGGCTTTTGTTTAATAACTT 

CAGCAACAACCTTTTGTGACTGAATGCCATGTTTCATTGCTCTCCTCCAGTTGCACATTGGACAAAGCCTGGATTTACAA 

AACCACACTCGATACAACTTTCTTTCGCCTGTTTCACGATTTTGTTTATACTCTAATATTTCAGCACAATCTTTTACTCT 

TTCAGCCTTTTTAAATTCAAGAATATGCAGAAGTTCAAAGTAATCAACATTAGCGATTTTCTTTTCTCTCCATGGGGAAT 

TGGAATTCTCAGTCGCTCCAGTTGCAAACGATTCTGGATAGTTTGCCGGATTTGCGATAGAACCAGGCCCGCCGGGAATA 

AAGAGATCCGTATTCCCCGCTGAAAACTCAGGGAAAATATCGGCCGAACGCCAGGCATTGACCATGTCTCTGTACCATTC 

ATCAAGTCCAGAGCCCCCTCCCCATGAGTTATTGACAACATCAGGAGCCATTTCCGGGTGGGGATTTCCTTCCGCGTCCT 

TTGGTGCTAAAACCCATTCACCAGCTTCCAAAATGTCAGCATCAGTGCCGCCATCTTCAGAGAACGCTTTAACAGCAATC 

CATTTTGCGCCAGGTGCTACACCGATTTGATTTGTTCCATCAGGTTCAGAGCCCACCATCGTGCCTGTCACGTGGGTTCC 

ATGAGCCAAATCATCATAAGGGCTTGCCTCGCCTGCTACGGCATCATACCAGTTCATTTCATTTTCAGGCTCATTAGGAT 

TTTCCGGATTATATCCGCGATATTTCTCTTTTAATGCCGGATGATTCCATTCCACCCCGGTATCAATGGACGCAACAACC 

GTGCCAGTTCCATCATATCCAAGTGCCCAAGCTTTTGGGGCATCGATTTGGTCTACATTCCATTCCACACCGTCAGTTGC 

TTTAATAGCTTTCTGTGCTTTTTTCATATTAAATGGGGAGGATGACTTAAAAAGCTGCCGTTTCTCATTAGGAAGCACCT 

TTTCCACTTCGGGAAACTGCACCACTTTTTCCATAACCTCTTTTGAGGCATGAACAGCAATCCCGTTCACCACATAATAA 

GAATGAATTTGGTCTGCATTTCCTTTATCTTTCTGGGTGTTCAAGTATTTTAGGACATCTTGCTGGGATTCATCGGCTGT 

GACTTTTAAAGATGACACAACAGCAGAACGCTTTTGATATTCCGTCTTAGCGGCAGACAGCTTCTTCGATTTCGCTTTTT 

TAACAGCCGCTTTTGCCGCTTTTTCTGGGTACCCCTATGTATTACTAGAAAATAACATAGTAAAACGGACATCACTCCGT 

TTCAATGGAGGTGATGTCCGTTTTTCATTACAACAAATTACTTATCTATTTGTAATGCTGCTCTTGGACCCGGGATCCGC 

AAGTGCTTTTACTAAATAATCCATTACTAAACCTCCTATACACTTCCTACGCGGTTTCTTTCATAAATCAATTCCAGCCC 

TTTTAGGGTTAAGAATGGATCAACGATGTCTATACAATCTGATTCGTTCGCAATGAGCGGCGCCAGGCCTCCTGTCGCAA 

TGACCTTGGGTCCTGTTTTGCCTGCCATTTCATTCGCTTAACGATTCCTTCCACTTGGCCGACATAGCCAAATAAAATTC 

CAGATTGCATCGCGCTAACAGTGTTTTTTCCGATAATATTGTCGGGCCGGGTGATTTCGATACGAGGAAGCTTTGCTGCA 

CGCGAGTAAAGCGCCTCTGTCGAAATTGTAATCCCAGGGGCAATCGCCCCGCCCATGTATTGTTTGTTTTCATCAATATA 

GCAGTACGTTGTGGCGGTTCCGAAATCGACAACAATTAATGGATTGCCGTACAAGTGTATCGCAGCGACAGCATTTACGA 

TTCTGTCTGCCCCTACTTCTTTCGGATTGTCATATTTTATATTTAAACCGGTTTTCATACCTGGACCAACAATTTGAGGC 

TCGATATGAAAGTATTTTGTGCACATTCTTTCTAACGCAAACATGATTGGCGGCACTACTGACGAAATAATAATGCCATC 

TATCTGTTCAAACATAAGCCCGGAGTGATCAAATAAGGAGCGCAAAATCATCCCAAACTCATCTTCTGTTTTATGCCTGC 
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TTGTTTCTATACGCCAGTGATATTCTAATTTTCCATCATGATATACACCAAGTACAGTATTGGTGTTCCCCACATCGATA 

ACCAGTAACAACCTCTATCACCACTTTTGAATAGTTTCTCTAGAACAGGCGGGGTTGCCCCCGCCTGTAATTAAATTATT 

ACACACCCTGTAGGGAAAGTCAATACCTTTTTGTAAAATTTTTACACAGCGTGGATCTCTTCTAGGGACACCTCTTTGTA 

CCCCTCAAGGGAGAAATATTGGCGGTACTGAGCACAGTTTTGGTTGGTGGACAGTGAACCATAGCTGTCGTCAATAGCCT 

CGAGTTATGGCAGTTGGTTAAAAGGAAACAAAAAGACCGTTTTCACACAAAACGGTCTTTTTCGATTTCTTTTTACAGTC 

ACAGCCACTTTTGCAAAAACCGGACAGCTTCATGCCTTATAACTGCTGTTTCGGTCGACAAGCTTCGCGAAGCGGCCGCA 

AAATTCACTGGCCGTCGTTTTACAACGTCGTGACTGGGAAAACCCTGGCGTTACCCAACTTAATCGCCTTGCAGCACATC 

CCCCTTTCGCCAGCTGGCGTAATAGCGAAGAGGCCCGCACCGATCGCCCTTCCCAACAGTTGCGCAGCCTGAATGGCGAA 

TGGCGCCTGATGCGGTATTTTCTCCTTACGCATCTGTGCGGTATTTCACACCGCATATGGTGCACTCTCAGTACAATCTG 

CTCTGATGCCGCATAGTTAAGCCAGCCCCGACACCCGCCAACACCCGCTGACTATGCTTGTAAACCGTTTTGTGAAAAAA 

TTTTTAAAATAAAAAAGGGGACCTCTAGGGTCCCCAATTAATTAGTAATATAATCTATTAAAGGTCATTCAAAAGGTCAT 

CCACCGGATCAGCTTAGTAAAGCCCTCGCTAGATTTTAATGCGGATGTTGCGATTACTTCGCCAACTATTGCGATAACAA 

GAAAAAGCCAGCCTTTCATGATATATCTCCCAATTTGTGTAGGGCTTATTATGCACGCTTAAAAATAATAAAAGCAGACT 

TGACCTGATAGTTTGGCTGTGAGCAATTATGTGCTTAGTGCATCTAACGCTTGAGTTAAGCCGCGCCGCGAAGCGGCGTC 

GGCTTGAACGAATTGTTAGACATTATTTGCCGACTACCTTGGTGATCTCGCCTTTCACGTAGTGGACAAATTCTTCCAAC 

TGATCTGCGCGCGAGGCCAAGCGATCTTCTTCTTGTCCAAGATAAGCCTGTCTAGCTTCAAGTATGACGGGCTGATACTG 

GGCCGGCAGGCGCTCCATTGCCCAGTCGGCAGCGACATCCTTCGGCGCGATTTTGCCGGTTACTGCGCTGTACCAAATGC 

GGGACAACGTAAGCACTACATTTCGCTCATCGCCAGCCCAGTCGGGCGGCGAGTTCCATAGCGTTAAGGTTTCATTTAGC 

GCCTCAAATAGATCCTGTTCAGGAACCGGATCAAAGAGTTCCTCCGCCGCTGGACCTACCAAGGCAACGCTATGTTCTCT 

TGCTTTTGTCAGCAAGATAGCCAGATCAATGTCGATCGTGGCTGGCTCGAAGATACCTGCAAGAATGTCATTGCGCTGCC 

ATTCTCCAAATTGCAGTTCGCGCTTAGCTGGATAACGCCACGGAATGATGTCGTCGTGCACAACAATGGTGACTTCTACA 

GCGCGGAGAATCTCGCTCTCTCCAGGGGAAGCCGAAGTTTCCAAAAGGTCGTTGATCAAAGCTCGCCGCGTTGTTTCATC 

AAGCCTTACGGTCACCGTAACCAGCAAATCAATATCACTGTGTGGCTTCAGGCCGCCATCCACTGCGGAGCCGTACAAAT 

GTACGGCCAGCAACGTCGGTTCGAGATGGCGCTCGATGACGCCAACTACCTCTGATAGTTGAGTCGATACTTCGGCGATC 

ACCGCTTCCCTCATGATGTTTAACTTTGTTTTAGGGCGACTGCCCTGCTGCGTAACATCGTTGCTGCTCCATAACATCAA 

ACATCGACCCACGGCGTAACGCGCTTGCTGCTTGGATGCCCGAGGCATAGACTGTACCCCAAAAAAACAGTCATAACAAG 

CCATGAAAACCGCCACTGCGCCGTTACCACCGCTGCGTTCGGTCAAGGTTCTGGACCAGTTGCGTGAGCGCATACGCTAC 

TTGCATTACAGCTTACGAACCGAACAGGCTTATGTCCACTGGGTTCGTGCCTTCATCCGTTTCCACGGTGTGCGTCACCC 

GGCAACCTTGGGCAGCAGCGAAGTCGAGGCATTTCTGTCCTGGCTGGCGAACGAGCGCAAGGTTTCGGTCTCCACGCATC 

GTCAGGCATTGGCGGCCTTGCTGTTCTTCTACGGCAAGGTGCTGTGCACGGATCTGCCCTGGCTTCAGGAGATCGGAAGA 

CCTCGGCCGTCGCGGCGCTTGCCGGTGGTGCTGACCCCGGATGAAGTGGTTCGCATCCTCGGTTTTCTGGAAGGCGAGCA 

TCGTTTGTTCGCCCAGCTTCTGTATGGAACGGGCATGCGGATCAGTGAGGGTTTGCAACTGCGGGTCAAGGATCTGGATT 

TCGATCACGGCACGATCATCGTGCGGGAGGGCAAGGGCTCCAAGGATCGGGCCTTGATGTTACCCGAGAGCTTGGCACCC 

AGCCTGCGCGAGCAGGGGAATTGATCCGGTGGATGACCTTTTGAATGACCTTTAATAGATTATATTACTAATTAATTGGG 

GACCCTAGAGGTCCCCTTTTTTATTTTAAAAATTTTTTCACAAAACGGTTTACAAGCATAACGGGTTTTGCTGCCCGCAA 

ACGGGCTGTTCTGGTGTTGCTAGTTTGTTATCAGAATCGCAGATCCGGCTTCAGGTTTGCCGGCTGAAAGCGCTATTTCT 

TCCAGAATTGCCATGATTTTTTCCCCACGGGAGGCGTCACTGGCTCCCGTGTTGTCGGCAGCTTTGATTCGATAAGCAGC 

ATCGCCTGTTTCAGGCTGTCTATGTGTGACTGTTGAGCTGTAACAAGTTGTCTCAGGTGTTCAATTTCATGTTCTAGTTG 

CTTTGTTTTACTGGTTTCACCTGTTCTATTAGGTGTTACATGCTGTTCATCTGTTACATTGTCGATCTGTTCATGGTGAA 

CAGCTTTAAATGCACCAAAAACTCGTAAAAGCTCTGATGTATCTATCTTTTTTACACCGTTTTCATCTGTGCATATGGAC 

AGTTTTCCCTTTGATATCTAACGGTGAACAGTTGTTCTACTTTTGTTTGTTAGTCTTGATGCTTCACTGATAGATACAAG 

AGCCATAAGAACCTCAGATCCTTCCGTATTTAGCCAGTATGTTCTCTAGTGTGGTTCGTTGTTTTTGCGTGAGCCATGAG 

AACGAACCATTGAGATCATGCTTACTTTGCATGTCACTCAAAAATTTTGCCTCAAAACTGGTGAGCTGAATTTTTGCAGT 

TAAAGCATCGTGTAGTGTTTTTCTTAGTCCGTTACGTAGGTAGGAATCTGATGTAATGGTTGTTGGTATTTTGTCACCAT 

TCATTTTTATCTGGTTGTTCTCAAGTTCGGTTACGAGATCCATTTGTCTATCTAGTTCAACTTGGAAAATCAACGTATCA 

GTCGGGCGGCCTCGCTTATCAACCACCAATTTCATATTGCTGTAAGTGTTTAAATCTTTACTTATTGGTTTCAAAACCCA 

TTGGTTAAGCCTTTTAAACTCATGGTAGTTATTTTCAAGCATTAACATGAACTTAAATTCATCAAGGCTAATCTCTATAT 

TTGCCTTGTGAGTTTTCTTTTGTGTTAGTTCTTTTAATAACCACTCATAAATCCTCATAGAGTATTTGTTTTCAAAAGAC 

TTAACATGTTCCAGATTATATTTTATGAATTTTTTTAACTGGAAAAGATAAGGCAATATCTCTTCACTAAAAACTAATTC 

TAATTTTTCGCTTGAGAACTTGGCATAGTTTGTCCACTGGAAAATCTCAAAGCCTTTAACCAAAGGATTCCTGATTTCCA 

cagttctcgtcatcagctctctggttgctttagctaatacaccataagcattttccctactgatgttcatcatctgagcg 
tattggttataagtgaacgataccgtccgttctttccttgtagggttttcaatcgtggggttgagtagtgccacacagca 
taaaattagcttggtttcatgctccgttaagtcatagcgactaatcgctagttcatttgctttgaaaacaactaattcag 
acatacatctcaattggtctaggtgattttaatcactataccaattgagatgggctagtcaatgataattactagtcctt 
ttcctttgagttgtgggtatctgtaaattctgctagacctttgctggaaaacttgtaaattctgctagaccctctgtaaa 
ttccgctagacctttgtgtgttttttttgtttatattcaagtggttataatttatagaataaagaaagaataaaaaaaga 
taaaaagaatagatcccagccctgtgtataactcactactttagtcagttccgcagtattacaaaaggatgtcgcaaacg 
ctgtttgctcctctac^u^cagaccttaaaaccctaaaggcttaagtagt^ccctcgcaagctcgggcaaatcgctgaa 

TATTCCTTTTGTCTCCGACCATCAGGCACCTGAGTCGCTGTCTTTTTCGTGACATTCAGTTCGCTGCGCTCACGGCTCTG 
GCAGTGAATGGGGGTAAATGGCACTACAGGCGCCTTTTATGGATTCATGCAAGGAAACTACCCATAATACAAGAAAAGCC 
CGTCACGGGCTTCTCAGGGCGTTTTATGGCGGGTCTGCTATGTGGTGCTATCTGACTTTTTGCTGTTCAGCAGTTCCTGC 

cctctgattttccagtctgaccacttcggattatcccgtgacaggtcattcagactggctaatgcacccagtaaggcagc 
ggtatcatcaacaggcttacccgtcttactgtcaac 
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SEQ ID NO: 74 - pOTP71 

CCATCGAATGGCCAGATGATTAATTCCTAATTTTTGTTGACACTCTAT CATTGATAGAGTTATTTTACCACTCCCTA 

TCAGTGATAGAGAAAAGTGAAATGAATAGTTCGACAAAAATCTAGAAAAGGAGGAATTTAAaTGTTACTGGTTATCG 

ATGTGGGGAACACCAATACTGTACTTGGTGTATATCATGATGGAAAATTAGAATATCACTGGCGTATAGAAACAAGC 

AGGCATAAAACAGAAGATGAGTTTGGGATGATTTTGCGCTCCTTATTTGATCACTCCGGGCTTATGTTTGAACAGAT 

AGATGGCATTATTATTTCGTCAGTAGTGCCGCCAATCATGTTTGCGTTAGAAAGAATGTGCACAAAATACTTTCATA 

TCGAGCCTCAAATTGTTGGTCCAGGTATGAAAACCGGTTTAAATATAAAATATGACAATCCGAAAGAAGTAGGGGCA 

GACAGAATCGTAAATGCTGTCGCTGCGATACACTTGTACGGCAATCCATTAATTGTTGTCGATTTCGGAACCGCCAC 

AACGTACTGCTATATTGATGAAAACAAACAATACATGGGCGGGGCGATTGCCCCTGGGATTACAATTTCGACAGAGG 

CGCTTTACTCGCGTGCAGCAAAGCTTCCTCGTATCGAAATCACCCGGCCCGACAATATTATCGGAAAAAACACTGTT 

AGCGCGATGCAATCTGGAATTTTATTTGGCTATGTCGGCCAAGTGGAAGGAATCGTTAAGCGAATGAAATGGCAGGC 

AAAACAGGACcCCAAGGTCATTGCGACAGGAGGCCTGGCGCCGCTCATTGCGAACGAATCAGATTGTATAGACATCG 

TTGATCCATTCTTAACCCTAAAAGGGCTGGAATTGATTTATGAAAGAAACCGCGTAGGAAGTGTATAaGGATCCCTC 

GAGGTCGACCTGCAGGGGGACCATGGTCTCAGCGCTTGGAGCCACCCGCAGTTCGAAAAATAATAAGCTTGACCTGT 

GAAGTGAAAAATGGCGCACATTGTGCGACATTTTTTTTGTCTGCCGTTTACCGCTACTGCGTCACGGATCTCCACGC 

GCCCTGTAGCGGCGCATTAAGCGCGGCGGGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAG 

CGCCCGCTCCTTTCGCTTTCTTCCCTTCCTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGG 

CTCCCTTTAGGGTTCCGATTTAGTGCTTTACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAG 

TGGGCCATCGCCCTGATAGACGGTTTTTCGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCC 

AAACTGGAACAACACTCAACCCTATCTCGGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGG 

TTAAAAAATGAGCTGATTTAACAAAAATTTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTCAGGTGGCAC 

TTTTCGGGGAAATGTGCGCGGAACCCCTATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGAC 

AATAACCCTGATAAATGCTTCAATAATATTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATT 

CCCTTTTTTGCGGCATTTTGCCTTCCTGTTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCA 

GTTGGGTGCACGAGTGGGTTACATCGAACTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAAC 

GTTTTCCAAT GATGAGCACTTTTAAAGTTCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAA 

CTCGGTCGCCGCATACACTATTCTCAGAATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGG 

CATGACAGTAAGAGAATTATGCAGTGCTGCCATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGA 

TCGGAGGACCGAAGGAGCTAACCGCTTTTTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCG 

GAGCTGAATGAAGCCATACCAAACGACGAGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACT 

ATTAACTGGCGAACTACTTACTCTAGCTTCCCGGCAACAATTGATAGACTGGATGGAGGCGGATAAAGTTGCAGGAC 

CACTTCTGCGCTCGGCCCTTCCGGCTGGCTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGCTCTCGCGGT 

ATCATTGCAGCACTGGGGCCAGATGGTAAGCCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTAT 

GGATGAACGAAATAGACAGATCGCTGAGATAGGTGCCTCACTGATTAAGGATTGGTAGGAATTAATGATGTCTCGTT 

TAGATAAAAGTAAAGTGATTAACAGCGCATTAGAGCTGCTTAATGAGGTCGGAATCGAAGGTTTAACAACCCGTAAA 

CTCGCCCAGAAGCTAGGTGTAGAGCAGCCTACATTGTATTGGCATGTAAAAAATAAGCGGGCTTTGCTCGACGCCTT 

AGCCATTGAGATGTTAGATAGGCACCATACTCACTTTTGCCCTTTAGAAGGGGAAAGCTGGCAAGATTTTTTACGTA 

AT AACGCTAAAAGTTTTAGATGTGCTTTACTAAGTCATCGCGATGGAGCAAAAGTACATTTAGGTACACGGCCTACA 

GAAAAACAGTATGAAACTCTCGAAAATCAATTAGCCTTTTTATGCCAACAAGGTTTTTCACTAGAGAATGCATTATA 

TGCACTCAGCGCAGTGGGGCATTTTACTTTAGGTTGCGTATTGGAAGATCAAGAGCATCAAGTCGCTAAAGAAGAAA 

GGGAAACACCTACTACTGATAGTATGCCGCCATTATTACGACAAGCTATCGAATTATTTGATCACCAAGGTGCAGAG 

CCAGCCTTCTTATTCGGCCTTGAATTGATCATATGCGGATTAGAAAAACAACTTAAATGTGAAAGTGGGTCTTAAAA 

GCAGCATAACCTTTTTCCGTGATGGTAACTTCACTAGTTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCA 

TGACCAAAATCCCTTAACGTGAGTTTTCGTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGA 

GATCCTTTTTTTCTGCGCGTAATCTGCTGCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGA 

TCAAGAGCTACCAACTCTTTTTCCGAAGGTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGT 

AGCCGTAGTTAGGCCACCACTTCAAGAACTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTG 

GCTGCTGCCAGTGGCGATAAGTCGTGTCTTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTC 

GGGCTGAACGGGGGGTTCGTGCACACAGCCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTG 

AGCTATGAGAAAGCGCCACGCTTCCCGAAGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGA 

GAGCGCACGAGGGAGCTTCCAGGGGGAAACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGA 

GCGTCGATTTTTGTGATGCTCGTCAGGGGGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCC 

TGGCCTTTTGCTGGCCTTTTGCTCACATGACCCGACA 



SEQ ID NO: 75 - pOTP72 

CCATCGAATGGCCAGATGATTAATTCCTAATTTTTGTTGACACTCTATCATTGATAGAGTTATTTTACCACTCCCTA 
TCAGTGATAGAGAAAAGTGAAATGAATAGTTCGACAAAAATCTAGAAAAGGAGGAATTTAAATGCCAGCTAGGCAAT 
CTTTTACAGATTTGAAAAACCTGGTTTTGTGCGATATAGGCAACACGCGTATCCATTTTGCACAAAACTAT CAGCTC 
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TTTT CAAGC G CT AAAG AAGAT T TAAAGCGTT TGGGTATT CAAAAGGAAAT TTT T T AC AT T AGC GT GAAT GAAGAAAA 
TGAAAAAGCCCTTTTGAATTGTTACCCTAACGCTAAAAATATTGCAGGGTTTTTTCATTTAGAAACCGACTATGTAG 
GGCTTGGGATAGACCGGCAAATGGCGTGTCTGGCGGTAAATAATGGCGTGGTGGTGGATGCCGGGAGTGCGATTACG 
ATAGATTTAATCAAAGAGGGCAAGCATTTAGGAGGGTGTATTTTACCCGGTTTAGCCCAATATATTCATGCGTATAA 
AAAAAGCGCTAAAATTTTAGAGCAACCTTTCAAGGCCTTAGATTCTTTAGAAGTTTTACCTAAAAGCACTAGAGACG 
CTGTGAATTACGGCATGGTTTTGAGCGTCATTGCTTGTATCCAGCATTTAGCCAAAAAT CAAAAAATCTATCTTTGT 
GGGGGCGATGCGAAGTATTTGAGCGCGTTTTTACCCCATTCTGTTTGCAAGGAGCGTTTGGTTTTTGACGGGATGGA 
AATCGCTCTTAAAAAAGCAGGGATACTAGAATGCAAATAAGGATCCCTCGAGGTCGACCTGCAGGGGGACCATGGTC 
TCAGCGCTTGGAGCCACCCGCAGTTCGAAAAATAATAAGCTTGACCTGTGAAGTGAAAAATGGCGCACATTGTGCGA 
CATTTTTTTTGTCTGCCGTTTACCGCTACTGCGTCACGGATCTCCACGCGCCCTGTAGCGGCGCATTAAGCGCGGCG 
GGTGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTC 
CTTTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTT 
TACGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTT 
CGCCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTC 
GGTCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAAT 
TTAACGCGAATTTTAACAAAATATTAACGCTTACAATTTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCT 
ATTTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATA 
TTGAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTG 
TTTTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAA 
CTGGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGT 
TCTGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGA 
ATGACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCT 
GCCATAAGCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTT 
TTTGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACG 
AGCGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCT 
TCCCGGCAACAATTGATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGG 
CTGGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGCTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTA 
AGCCCTCCCGTATCGTAGTTATCTACACGACGGGGPGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAG 
ATAGGTGCCTCACTGATTAAGCATTGGTAGGAATTAATGATGTCTCGTTTAGATAAAAGTAAAGTGATTAACAGCGC 
ATTAGAGCTGCTTAATGAGGTCGGAATCGAAGGTTTAACAACCCGTAAACTCGCCCAGAAGCTAGGTGTAGAGCAGC 
CTACATTGTATTGGCATGTAAAAAATAAGCGGGCTTTGCTCGACGCCTTAGCCATTGAGATGTTAGATAGGCACCAT 
ACTCACTTTTGCCCTTrAGAAGGGGAAAGCTGGCAAGATTTTTTACGTAATAACGCTA?iAAGTTTTAGATGTGCTTT 
ACTAAGTCATCGCGATGGAGCAAAAGTACATTTAGGTACACGGCCTACAGAAAAACAGTATGAAACTCTCGAAAATC 
AATTAGCCTTTTTATGCCAACAAGGTTTTTCACTAGAGAATGCATTATATGCACTCAGCGCAGTGGGGCATTTTACT 
TTAGGTTGCGTATTGGAAGATCAAGAGCATCAAGTCGCTAAAGAAGAAAGGGAAACACCTACTACTGATAGTATGCC 
GCCATTATTACGACAAGCTATCGAATTATTTGATCACCAAGGTGCAGAGCCAGCCTTCTTATTCGGCCTTGAATTGA 
TCATATGCGGATTAGAAAAACAACTTAAATGTGAAAGTGGGTCTTAAAAGCAGCATAACCTTTTTCCGTGATGGTAA 
CTTCACTAGTTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTC 
GTTCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCT 
GCTTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAG 
GTAACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAA 
CTCTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTC 
TTACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAG 
CCCAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGA 
AGGGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAA 
ACGCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGG 
GGGCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACAT 
GACCCGACA 



SEQ ID NO: 76 - pOTP73 

CCATCGAATGGCCAGATGAT TAATTCCTAATTTTTGTTGACACTCTATCATTGATAGAGTTATTTTACCACTCCCTA 
TCAGT GAT AGAGAAAAGTGAAAT GAAT AGTTCGACAAAAATCTAGAAAAGGAGGAATTTAAAT GAT TCTTGAGCTCG 
ACTGTGGAAACTCGCTGATCAAGTGGCGGGTCATCGAGGGGGCGGCGCGGTCGGTCGCCGGTGGCCTTGCGGAGTCC 
GATGATGCCCTGGTCGAACAGTTAACGTCGCAGCAAGCGCTGCCAGTGCGAGCCTGTCGCCTGGTGAGCGTTCGCAG 
CGAGCAGGAAACCTCGCAACTGGTCGCACGGTTGGAGCAGCTGTTCCCGGTTTCGGCGCTGGTTGCATCATCCGGCA 
AGCAGTTGGCGGGTGTGCGCAACGGCTATCTCGATTACCAGCGCCTGGGGCTCGACCGCTGGCTGGCCCTCGTCGCG 
GCTCATCACCTGGCTAAGAAGGCCTGCCTGGTCATTGATCTGGGGACCGCGGTCACCTCTGACCTGGTCGCGGCGGA 
TGGAGTGCATCTGGGGGGCTACATATGCCCGGGCATGACCCTGATGAGAAGCCAGTTGCGCACCCATACCCGACGTA 
TCCGCTACGACGATGCAGAGGCCCGGCGGGCGCTTGCCAGTCTCCAGCCAGGGCAGGCCACGGCCGAGGCGGTTGAG 
CGGGGTTGTCTGCTCATGCTCAGGGGGTTCGTTCGTGAGCAGTACGCCATGGCGTGCGAGCTGCTCGGTCCGGATTG 
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TGAAATATTCCTGACGGGTGGGGATGCCGAACTGGTTCGCGACGAACTGGCTGGCGCCCGGATCATGCCGGACCTGG 
TTTTCGTAGGGCTGGCACTGGCTTGCCCGATTGAGTAAGGATCCCTCGAGGTCGACCTGCAGGGGGACCATGGTCTC 
AGCGCTTGGAGCCACCCGCAGTTCGAAAAATAATAAGCTTGACCTGTGAAGTGAAAAATGGCGCACATTGTGCGACA 
TTTTTTTTGTCTGCCGTTTACCGCTACTGCGTCACGGATCTCCACGCGCCCTGTAGCGGCGCATTAAGCGCGGCGGG 
TGTGGTGGTTACGCGCAGCGTGACCGCTACACTTGCCAGCGCCCTAGCGCCCGCTCCTTTCGCTTTCTTCCCTTCCT 
TTCTCGCCACGTTCGCCGGCTTTCCCCGTCAAGCTCTAAATCGGGGGCTCCCTTTAGGGTTCCGATTTAGTGCTTTA 
CGGCACCTCGACCCCAAAAAACTTGATTAGGGTGATGGTTCACGTAGTGGGCCATCGCCCTGATAGACGGTTTTTCG 
CCCTTTGACGTTGGAGTCCACGTTCTTTAATAGTGGACTCTTGTTCCAAACTGGAACAACACTCAACCCTATCTCGG 
TCTATTCTTTTGATTTATAAGGGATTTTGCCGATTTCGGCCTATTGGTTAAAAAATGAGCTGATTTAACAAAAATTT 
AACGCGAATTTTAACAAAATATTAACGCTTACAATTTCAGGTGGCACTTTTCGGGGAAATGTGCGCGGAACCCCTAT 
TTGTTTATTTTTCTAAATACATTCAAATATGTATCCGCTCATGAGACAATAACCCTGATAAATGCTTCAATAATATT 
GAAAAAGGAAGAGTATGAGTATTCAACATTTCCGTGTCGCCCTTATTCCCTTTTTTGCGGCATTTTGCCTTCCTGTT 
TTTGCTCACCCAGAAACGCTGGTGAAAGTAAAAGATGCTGAAGATCAGTTGGGTGCACGAGTGGGTTACATCGAACT 
GGATCTCAACAGCGGTAAGATCCTTGAGAGTTTTCGCCCCGAAGAACGTTTTCCAATGATGAGCACTTTTAAAGTTC 
TGCTATGTGGCGCGGTATTATCCCGTATTGACGCCGGGCAAGAGCAACTCGGTCGCCGCATACACTATTCTCAGAAT 
GACTTGGTTGAGTACTCACCAGTCACAGAAAAGCATCTTACGGATGGCATGACAGTAAGAGAATTATGCAGTGCTGC 
CATAACCATGAGTGATAACACTGCGGCCAACTTACTTCTGACAACGATCGGAGGACCGAAGGAGCTAACCGCTTTTT 
TGCACAACATGGGGGATCATGTAACTCGCCTTGATCGTTGGGAACCGGAGCTGAATGAAGCCATACCAAACGACGAG 
CGTGACACCACGATGCCTGTAGCAATGGCAACAACGTTGCGCAAACTATTAACTGGCGAACTACTTACTCTAGCTTC 
CCGGCAACAATTGATAGACTGGATGGAGGCGGATAAAGTTGCAGGACCACTTCTGCGCTCGGCCCTTCCGGCTGGCT 
' GGTTTATTGCTGATAAATCTGGAGCCGGTGAGCGTGGCTCTCGCGGTATCATTGCAGCACTGGGGCCAGATGGTAAG 
CCCTCCCGTATCGTAGTTATCTACACGACGGGGAGTCAGGCAACTATGGATGAACGAAATAGACAGATCGCTGAGAT 
AGGTGCCTCACTGATTAAGCATTGGTAGGAATTAATGATGTCTCGTTTAGATAAAAGTAAAGTGATTAACAGCGCAT 
TAGAGCTGCTTAATGAGGTCGGAATCGAAGGTTTAACAACCCGTAAACTCGCCCAGAAGCTAGGTGTAGAGCAGCCT 
ACATTGTATTGGCATGTAAAAAATAAGCGGGCTTTGCTCGACGCCTTAGCCATTGAGATGTTAGATAGGCACCATAC 
TCACTTTTGCCCTTTAGAAGGGGAAAGCTGGCAAGATTTTTTACGTAATAACGCTAAAAGTTTTAGATGTGCTTTAC 
TAAGTCATCGCGATGGAGCAAAAGTACATTTAGGTACACGGCCTACAGAAAAACAGTATGAAACTCTCGAAAATCAA 
TTAGCCTTTTTATGCCAACAAGGTTTTTCACTAGAGAATGCATTATATGCACTCAGCGCAGTGGGGCATTTTACTTT 
AGGTTGCGTATTGGAAGATCAAGAGCAT CAAGTCGCTAAAGAAGAAAGGGAAACACCTAC TACTGATAGTATGCCGC 
" CATTATTACGACAAGCTATCGAATTATTTGATCACCAAGGTGCAGAGCCAGCCTTCTTATTCGGCCTTGAATTGATC 
ATATGCGGATTAGAAAAACAACTTAAATGTGAAAGTGGGTCTTAAAAGCAGCATAACCTTTTTCCGTGATGGTAACT 
TCACTAGTTTAAAAGGATCTAGGTGAAGATCCTTTTTGATAATCTCATGACCAAAATCCCTTAACGTGAGTTTTCGT 
TCCACTGAGCGTCAGACCCCGTAGAAAAGATCAAAGGATCTTCTTGAGATCCTTTTTTTCTGCGCGTAATCTGCTGC 
TTGCAAACAAAAAAACCACCGCTACCAGCGGTGGTTTGTTTGCCGGATCAAGAGCTACCAACTCTTTTTCCGAAGGT 
AACTGGCTTCAGCAGAGCGCAGATACCAAATACTGTCCTTCTAGTGTAGCCGTAGTTAGGCCACCACTTCAAGAACT 
CTGTAGCACCGCCTACATACCTCGCTCTGCTAATCCTGTTACCAGTGGCTGCTGCCAGTGGCGATAAGTCGTGTCTT 
ACCGGGTTGGACTCAAGACGATAGTTACCGGATAAGGCGCAGCGGTCGGGCTGAACGGGGGGTTCGTGCACACAGCC 
CAGCTTGGAGCGAACGACCTACACCGAACTGAGATACCTACAGCGTGAGCTATGAGAAAGCGCCACGCTTCCCGAAG 
GGAGAAAGGCGGACAGGTATCCGGTAAGCGGCAGGGTCGGAACAGGAGAGCGCACGAGGGAGCTTCCAGGGGGAAAC 
GCCTGGTATCTTTATAGTCCTGTCGGGTTTCGCCACCTCTGACTTGAGCGTCGATTTTTGTGATGCTCGTCAGGGGG 
GCGGAGCCTATGGAAAAACGCCAGCAACGCGGCCTTTTTACGGTTCCTGGCCTTTTGCTGGCCTTTTGCTCACATGA 
CCCGACA 



